DNA sequence surrounding the glucocerebrosidase gene

ABSTRACT

The present invention provides the discovery and isolation of the nucleotide sequence of the human clk2, the human propin1, and the human cote1 genes that are located within the glucocerebrosidase gene locus. Also provided by the present invention are proteins or polypeptides encoded by those genes, nucleic acids encoding those polypeptides, and antibodies to those proteins. Further provided by the present invention are nucleic acids of the same apparent molecular size as the propin1 and cote1 gene regions and which have the same restriction pattern as the wild-type genes.

BACKGROUND OF THE INVENTION

[0001] 1. Field of the Invention

[0002] This invention relates generally to genes and nucleic acidssurrounding the glucocerebrosidase gene (GBA). The invention alsogenerally relates to genes and nucleic acids encoding clk2, cote1 andpropin1, and the proteins encoded therein.

[0003] 2. Background Art

[0004] Gaucher disease, the inherited deficiency of the enzymeglucocerebrosidase (EC 3.2.1.45), is the most common lysosomal hydrolasedeficiency. The gene for glucocerebrosidase (GBA) is located onchromosome 1q21 (Ginns et al. 1985. “Gene mapping and leader polypeptidesequence of human glucocerebrosidase: implications for Gaucher disease.”Proc. Natl. Acad. Sci. USA 82: 7101-7105.) and is composed of 11 exons(Horowitz et al. 1989. “The human glucocerebrosidase gene andpseudogene: structure and evolution.” Genomics 4:87-96.). A highlyhomologous pseudogene (psGBA) is located nearby (Choudary et al.1985.“Molecular cloning and analysis of the human -glucocerebrosidase gene.”DNA 4:74.), and has contributed significantly to the origin of mutationsin glucocerebrosidase (Tsuji et al. 1987. “A mutation in the humanglucocerebrosidase gene in neuronopathic Gaucher's disease.” N. Engl. J.Med. 316: 570-575.). Cormand et al. have recently provided alocalization of this region relative to 6 markers from the Généthonhuman linkage map (cormand et al. 1997. “Genetic fine localization ofthe -glucocerebrosidase (GBA) and prosaposin (PSAP) genes: implicationsfor Gaucher disease.” Hum. Genet. 100:75-79.). Analyses of the mutationspresent in patients have revealed both single missense mutations(Beutler et al. 1997. “Hematologically important mutations: Gaucherdisease” Blood Cells Mol. & Dis. 23:2-7.) and recombinant alleles,including several mutations which originate from the pseudogene sequence(Latham et al.1990. “Complex alleles of the acid-glucocerebrosidase genein Gaucher disease.” Am. J. Hum. Genet. 47:79-86; Eyal et al. 1990.“Prevalent and rare mutations among Gaucher patients.” Gene 96:277-283.). Patients have also been described with alleles resulting froma fusion between GBA and psGBA (Zimran et al 1990. “A glucocerebrosidasefusion gene in Gaucher disease. Implications for the molecular anatomy,pathogenesis, and diagnosis of this disorder.” J. Clin. Invest. 85:219-222.).

[0005] Many attempts have been made to correlate patient genotypes withthe clinical presentation of Gaucher disease. While there is somepredictive value of certain alleles for either mild or severe disease(Zimran et al. 1989. “Prediction of severity of Gaucher's disease byidentification of mutations at DNA level.” Lancet 2: 349-352; Beutlerand Grabowski, 1995. “Gaucher disease. In The Metabolic Basis ofinherited disease (eds. C. R. Scriver, A. L. Beaudet, W. S. Sly, D.Valle), pp. 2641-2670.” McGraw-Hill Information Services Co., HealthProfessions Division, New York), no specific symptom complex can becorrelated with a unique genotype (Sidransky et al. 1994. “DNAmutational analysis of type 1 and type 3 Gaucher patients: How well domutations predict phenotype? Hum. Mutat. 3: 25-28.). Based on clinicalpresentation, Gaucher disease has been divided into three types. Type 1patients have very heterogeneous presentations, ranging fromasymptomatic adults to young children with severe hepatosplenomegaly andbone involvement. Type 2 is invariably fatal, with infants classicallydeveloping symptoms at two to six months and dying by two years of age(Frederikson et al. 1972. “Glucosylceramide lipidoses: Gaucher'sdisease.” In The Metabolic Basis of inherited Disease (eds. J. B.Stanbury, J. B. Wyngarden, and D. S. Frederickson), pp.730-759.McGraw-Hill International Book Co., New York.). More recently, thesevere phenotype of a knockout mouse model of Gaucher disease(Tybulewicz et al. 1992. “Animal model of Gaucher's disease fromtargeted disruption of the mouse glucocerebrosidase gene.” Nature 357:407410.) prompted the recognition of a subset of severely affected type2 patients which present and die in the perinatal period (Sidransky etal. 1992. “Gaucher disease in the neonate: A distinct Gaucher phenotypeis analogous to a mouse model created by targeted disruption of theglucocerebrosidase gene.” Pediatr. Res. 32: 494498.). Type 3 is ofintermediate severity, and includes patients with varying degrees ofneurological impairment that develops in childhood or early adulthood.

[0006] A recent attempt to generate a point mutation mouse model ofGaucher disease led to the discovery of a novel gene, metaxin (MTX),which in the mouse is contiguous to and transcribed convergently toglucocerebrosidase. Metaxin shares a bidirectional promoter with thegene for thrombospondin 3. The insertion of a neomycin resistancecassette in the 3′ flanking region of GBA resulted in a knockout of themurine metaxin gene (Bornstein et al. 1995. “Metaxin, a gene contiguousto both thrombospondin 3 and glucocerebrosidase, is required forembryonic development in the mouse: Implications for Gaucher disease.”Proc. Natl. Acad. Sci. 92: 4547-4551.). Metaxin is a component of theprotein translocation apparatus of the mitochondrial outer membrane(Armstrong et al. 1997. “Metaxin is a component of a preprotein importcomplex in the outer membrane of the mammalian mitochondrion.” J BiolChem 272: 6510-6518.). Homozygosity for the metaxin knockout results inan embryonic lethal phenotype. Human metaxin is located downstream topsGBA and a pseudogene for metaxin was subsequently identifieddownstream to GBA in the intergenic region (Long et al. 1996. “Structureand organization of the human metaxin gene (MTX) and pseudogene.”Genomics 33; 177-184.). The region downstream to psGBA encodes formetaxin, thrombospondin 3, (Thbs3) (Vos et al. 1992. “Thrombospondin 3(Thbs3), a new member of the thrombospondin gene family.” J. Biol. Chem.267: 12192-12196.; Adolph et al 1995. “Structure and organization of thehuman thrombospondin 3 gene (THBS3).” Genomics 27: 329-336.), andpolymorphic epithelial mucin 1 (Muc1) (Ligtenberg et al 1990.“Episialin, a carcinoma-associated mucin, is generated by a polymorphicgene encoding splice variants with alternative amino termini.” J. Biol.Chem. 265: 5573-5578.; Vos et al. 1995. “A tightly organized conservedgene cluster on mouse chromosome 3 (E3-F1).” Mamm. Genome 6: 820-822.).There is, therefore, an association of genes surrounding theglucocerebrosidase gene with clinical implications of Gaucher's disease.In order to elucidate these interactions, it is first necessary todiscover the identity of the sequences in the area of theglucocerebrosidase gene. This invention provides that essentialinformation.

[0007] This invention therefore provides the genomic DNA sequence and amore detailed organization of a 75 kb region around theglucocerebrosidase locus, including the duplicated region containingglucocerebrosidase and metaxin. Also provided are the origin andendpoints of the duplication leading to the pseudogenes for GBA and MTX.The invention further provides three new genes within the 32 kb ofsequence upstream to GBA. Due to the close proximity of these threegenes to GBA, common locus control, regulatory elements and overlappingtranscripts may coordinately affect expression of these genes and, inthe case of Gaucher's disease, influence the expression level ofglucocerebrosidase. The potential involvement of contiguous gene effectscould explain the more unusual phenotypes encountered among Gaucherpatients. Of these three genes, the gene most distal to GBA is a proteinkinase (clk2). The second gene,propin1, shares homology to a ratSecretory CArrier Membrane Protein (SCAMP37). Finally, the cote1 genelies most proximal to GBA. Also provided are the nucleic acids encodingthese polypeptides and the proteins encoded therein.

SUMMARY OF THE INVENTION

[0008] The present invention provides the discovery and isolation of thenucleotide sequence of the human clk2, the human propin1, and the humancote1 genes that are located within the glucocerebrosidase gene locus.

[0009] Also provided by the present invention are proteins orpolypeptides encoded by those genes, nucleic acids encoding thosepolypeptides, and antibodies to those proteins.

[0010] Further provided by the present invention are nucleic acids ofthe same apparent molecular size as the propin1 and cote1 gene regionsand which have the same restriction pattern as the wild-type genes.

BRIEF DESCRIPTION OF THE FIGURE

[0011]FIG. 1 shows a restriction map of the 5′ sequence flanking theglucocerebrosidase gene. The location of the clk2, propin1 and cote1genes are indicated as follows: the 3′ end of clk2 corresponds tonucleic acid 10482, the 5′ and 3′ ends of propin1 correspond to nucleicacids 10950 and 17388, respectively, and the 5′ and 3′ ends of cote1correspond to nucleic acids 17913 and 26165, respectively.

DETAILED DESCRIPTION OF THE INVENTION

[0012] The present invention may be understood more readily by referenceto the following detailed description of the preferred embodiments ofthe invention and the Example included therein.

[0013] Before the present compounds and methods are disclosed anddescribed, it is to be understood that this invention is not limited tospecific proteins, specific methods, or specific nucleic acids, as suchmay, of course, vary. It is also to be understood that the terminologyused herein is for the purpose of describing particular embodiments onlyand is not intended to be limiting.

[0014] It must be noted that, as used in the specification and theappended claims, the singular forms “a,” “an,” and “the” include pluralreferents unless the context clearly dictates otherwise. Thus, forexample, reference to “a nucleic acid” includes multiple copies of thenucleic acid and can also include more than one particular species ofmolecule.

[0015] In one aspect, the invention relates to a nucleic acid comprisingthe nucleic acid set forth in the Sequence Listing as SEQ ID NO: 1. Thenucleic acid encodes the genes for human clk2, human propin1, humancote1, human GBA, human psMTX, human psGBA, human MTX and human Thbs3.The nucleic acid encodes human genes, which includes sequences both 5′and 3′ to the coding regions of the genes. The nucleic acid set forth inSEQ ID NO: 1 represents the sequence of a genomic clone and thereforeincludes introns of the encoded genes.

[0016] In another aspect, the invention relates to a nucleic acidcomprising the nucleic acid set forth in the Sequence Listing as any ofSEQ ID NO: 2, SEQ ID NO: 4, and SEQ ID NO: 6 which correspond to thecoding regions of the genes for human clk2, human propin1 and humancote1, respectively. These nucleic acids encode these respective genes,but do not include sequences both 5′ and 3′ to the coding region. Thenucleic acid set forth in any of SEQ ID NO: 2, SEQ ID NO: 4 and SEQ IDNO: 6, therefore does not include any introns of these human genes.

[0017] The invention further provides a nucleic acid encoding thepolypeptide set forth in any of SEQ ID NO: 3, SEQ ID NO: 5 and SEQ IDNO: 7, which are the clk2, propin1, and cote1 proteins encoded by thenucleic acids in SEQ ID NO: 2, 4, and 6. The invention further providesthose nucleic acids encoding the proteins provided herein. Furtherprovided by the present invention are genomic sequences corresponding tothe propin1 unspliced RNA, the cote1 unspliced RNA, and the clk2unspliced RNA, namely SEQ ID NO: 8, SEQ ID NO: 9, and SEQ ID NO: 10,respectively.

[0018] As used herein, the term “nucleic acid” refers to single-ormultiple stranded molecules which may be DNA or RNA, or any combinationthereof, including modifications to those nucleic acids. The nucleicacid may represent a coding strand or its complement, or any combinationthereof Nucleic acids may be identical in sequence to the sequenceswhich are naturally occurring for any of the novel genes discussedherein or may include alternative codons which encode the same aminoacid as that which is found in the naturally occurring sequence. Thesenucleic acids can also be modified from their typical structure. Suchmodifications include, but are not limited to, methylated nucleic acids,the substitution of a non-bridging oxygen on the phosphate residue witheither a sulfur (yielding phophorothioate deoxynucleotides), selenium(yielding phosphorselenoate deoxynucleotides), or methyl groups(yielding methylphosphonate deoxynucleotides).

[0019] Similarly, one skilled in the art will recognize that compoundscomprising the genes, nucleic acids, and fragments of the genes andnucleic acids as disclosed and contemplated herein are also provided.For example, a compound comprising a nucleic acid can be a derivative ofa typical nucleic acid such as nucleic acids which are modified tocontain a terminal or internal reporter molecule and/or those nucleicacids containing non-typical bases or sugars. These reporter moleculesinclude, but are not limited to, isotopic and non-isotopic reporters.Therefore any molecule which may aid in detection, amplification,replication, expression, purification, uptake, etc. may be added to thenucleic acid construct.

[0020] The term “gene” as used herein means a unit of heredity thatoccupies a specific locus on a chromosome as well as any sequencesassociated with the expression of that nucleic acid. For example, thegene includes any introns normally present within the coding region aswell as regions preceding and following the coding region. Examples ofthese non-coding regions include, but are not limited to transcriptiontermination regions, promoter regions, enhancer regions and modulationregions.

[0021] The regions upstream and downstream of GBA may act as promoterfactors for GBA expression and may be involved in the pathophysiology ofGaucher's disease. Additionally, the regions flanking GBA may containpotential cis-active elements that are required for regulation of GBAgene transcription. The genomic locus described herein may also encodeoverlapping sense and antisense RNAs. Such transcripts may be involvedin the regulation of GBA expression and may play a role in thedevelopment of Gaucher's disease, through the activity of the respectivegene products alone, or in combination with the other gene products,typically those surrounding the GBA gene. Antisense control of sensetranscripts may be exerted at the level of transcription, maturation,transport, stability and translation of GBA. For example, the 3′ UTR ofGBA may contain a region of antisense complementarity to a region ofclk2, propin1 or cote1, whereby antisense interaction regulates theactivity of the respective gene. Where sufficient mutations within thesegenes occur, this interaction is then be disrupted causing GBA to beincorrectly transcribed, processed, transported, translated etc.Alternatively, the clk2, propin1, and cote1 genes may encode proteinsthat covalently or noncovalently associate with GBA. Mutations withinthe genes contemplated in this invention may therefore result in mutantproteins that are deficient in their ability to effect thisprotein-protein interaction.

[0022] To identify protein-protein interactions between the genescontemplated in this invention and GBA or other proteins, conventionalyeast two hybrid assays can be utilized. These procedures employcommercially available kits (e.g., Matchmaker* from Clontech, Palo Alto,Calif.) and involve fusing the “bait” (in this example, all or part ofclk2, propin1 or cote1) to a DNA binding protein, such as GAL4, andfusing the “target” (e.g. a cDNA library) to an activating protein, suchas the activation element domain of VP-16. Both constructs are thentransformed into yeast cells containing a selectable marker gene underthe control of, in this example, a GAL4 binding element. Thus, onlythose yeast colonies containing cDNAs of proteins or protein fragmentsthat interact with the GALA4-bait constructs will be detected. The cDNAof the target construct is isolated from positive clones andconventional methods used to isolate the cDNA encoding the protein orprotein fragment that interacts with the GAL4-bait construct.

[0023] The genes and nucleic acids provided for by the present inventionmay be obtained in any number of ways. For example, a DNA moleculeencoding clk2, propin1 or cote1 can be isolated from the organism inwhich it is normally found. For example, a genomic DNA or cDNA librarycan be constructed and screened for the presence of the gene or nucleicacid of interest. Methods of constructing and screening such librariesare well known in the art and kits for performing the construction andscreening steps are commercially available (for example, StratageneCloning Systems, La Jolla, calif.). Once isolated, the gene or nucleicacid can be directly cloned into an appropriate vector, or if necessary,be modified to facilitate the subsequent cloning steps. Suchmodification steps are routine, an example of which is the addition ofoligonucleotide linkers which contain restriction sites to the terminiof the nucleic acid. General methods are set forth in Sambrook et at.“Molecular Cloning, a Laboratory Manual,” Cold Spring Harbor LaboratoryPress (1989).

[0024] Another example of a method of obtaining a DNA molecule encodinga specific gene, CDS, mRNA or protein of the present invention is tosynthesize a recombinant DNA molecule which encodes that protein. Forexample, oligonucleotide synthesis procedures are routine in the art andoligonucleotides coding for a particular protein region are readilyobtainable through automated DNA synthesis. A nucleic acid for onestrand of a double-stranded molecule can be synthesized and hybridizedto its complementary strand. One can design these oligonucleotides suchthat the resulting double-stranded molecule has either internalrestriction sites or appropriate 5′ or 3′ overhangs at the termini forcloning into an appropriate vector. Double-stranded molecules coding forrelatively large proteins can readily be synthesized by firstconstructing several different double-stranded molecules that code forparticular regions of the protein, followed by ligating these DNAmolecules together. For example, Cunningham, et al. “Receptor andAntibody Epitopes in Human Growth Hormone Identified by Homolog-ScanningMutagenesis,” Science, 243:1330-1336 (1989), have constructed asynthetic gene encoding the human growth hormone gene by firstconstructing overlapping and complementary synthetic oligonucleotidesand ligating these fragments together. See also, Ferretti, et al. Proc.Nat. Acad. Sci. 82:599-603 (1986), wherein synthesis of a 1057 base pairsynthetic bovine rhodopsin gene from synthetic oligonucleotides isdisclosed. By constructing the desired sequence in this manner, oneskilled in the art can readily obtain any particular protein such asclk2, propin1, or cote1, with desired amino acids at any particularposition or positions within the protein. See also, U.S. Pat. No.5,503,995 which describes an enzyme template reaction method of makingsynthetic genes. Techniques such as this are routine in the art and arewell documented. These nucleic acids can then be expressed in vivo or invitro as discussed below.

[0025] Once the gene or nucleic acid sequence of the desired gene isobtained, the sequence encoding specific amino acids can be modified orchanged at any particular amino acid position by techniques well knownin the art. For example, PCR primers can be designed which span theamino acid position or positions and which can substitute any amino acidfor another amino acid. Then a nucleic acid can be amplified andinserted into the wild-type coding sequence in order to obtain any of anumber of possible combinations of amino acids at any position of thegene. Alternatively, one skilled in the art can introduce specificmutations at any point in a particular nucleic acid sequence throughtechniques for point mutagenesis. General methods are set forth inSmith, “In vitro mutagenesis” Ann. Rev. Gen., 19:423-462 (1985) andZoller, “New molecular biology methods for protein engineering” Curr.Opin. Struct Biol., 1:605-610 (1991). Techniques such as these can alsobe used to modify the genes or nucleic acids in regions other than thecoding regions, such as the promoter regions or any regulatory ornoncoding region.

[0026] As used herein, the term “isolated” refers to a nucleic acidseparated or significantly free from at least some of the othercomponents of the naturally occurring organism, for example, the cellstructural components commonly found associated with nucleic acids in acellular environment and/or other nucleic acids. The isolation of thenative nucleic acids can be accomplished, for example, by techniquessuch as cell lysis followed by phenol plus chloroform extraction,followed by ethanol precipitation of the nucleic acids. The nucleicacids of this invention can be isolated from cells according to any ofmany methods well known in the art.

[0027] An isolated nucleic acid comprising a unique fragment of at least10 nucleotides of the nucleic acid set forth in the Sequence Listing asany of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID: 6, SEQ ID: 8,SEQ ID: 9 or SEQ ID NO: 10 is also provided. Unique fragments, as usedherein means a nucleic acid of at least 10 nucleotides that is notidentical to any other known nucleic acid sequence. Examples of thesequences of at least 10 nucleotides that are unique to clk2, propin1and cote1 can be readily ascertained by comparing the sequence of thenucleic acid in question to sequences catalogued in GenBank, or anyother sequence database, using the computer programs such as DNASIS(Hitachi Engineering, Inc.) or Word Search or FASTA of the GeneticsComputer Group (GCG) (Madison, Wis.), which search the cataloguednucleotide sequences for similarities to the nucleic acid in question.If the sequence does not match any of the known sequences, it is unique.For example, the sequence of nucleotides 1-10 can be used to search thedatabases for an identical match. If no matches are found, thennucleotides 1-10 represent a unique fragment. Next, the sequence ofnucleotides 2-11 can be used to search the databases, then the sequenceof nucleotides 3-13, and so on up to nucleotides 75260 to 75270 of thesequence set forth in the Sequence Listing as SEQ ID NO: 1. The sametype of search can be performed for sequences of 11 nucleotides, 12nucleotides, 13 nucleotides, etc. The possible fragments range from 10nucleotides in length to 1 nucleotides less that the sequence set forthin the Sequence Listing as SEQ ID NO: 1. These unique nucleic acids, aswell as degenerate nucleic acids can be used, for example, as primersfor amplifying nucleic acids in order to isolate allelic variants of theclk2, propin1 or cote1 proteins or as primers for reverse transcriptionof clk2, propin1 or cote1 mRNA, or as probes for use in detectiontechniques such as nucleic acid hybridization. One skilled in the artwill appreciate that even though a nucleic acid of at least 10nucleotides is unique to a specific gene, that nucleic acid fragment canstill hybridize to many other nucleic acids and therefore be used intechniques such as amplification and nucleic acid detection.

[0028] Also provided are allelic variants of the clk2, propin1 and cote1proteins set forth in the Sequence Listing as any of SEQ ID NO: 3, SEQID NO: 5, and SEQ ID NO: 7. As used herein, the term “allelicvariations” or “allelic variants” is used to describe the same, orsimilar proteins that are diverged from the clk2, propin1, or cote1proteins set forth in SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7 byless than 20% in their corresponding amino acid identity. In anotherembodiment, these allelic variants are less than 18% divergent in theircorresponding amino acid identity. In another embodiment, these allelicvariants are less than 15% divergent in their corresponding amino acididentity. In another embodiment, these allelic variants are less than12% divergent in their corresponding amino acid identity. In anotherembodiment, these allelic variants are less than 10% divergent in theircorresponding amino acid identity. In another embodiment, these allelicvariants are less than 7% divergent in their corresponding amino acididentity. In another embodiment, these allelic variants are less than 5%divergent in their corresponding amino acid identity. In anotherembodiment, these allelic variants are less than 3% divergent in theircorresponding amino acid identity. In another embodiment, these allelicvariants are less than 2% divergent in their corresponding amino acididentity. In yet another embodiment, these allelic variants are lessthan 1% divergent in their corresponding amino acid identity. Theseamino acids can be substitutions within the amino acid sequence setforth in the Sequence Listing as any of SEQ ID NO: 3, SEQ ID NO: 5, andSEQ ID NO: 7, they can be deletions from the amino acid sequence setforth in the Sequence Listing as SEQ ID NO: 3, SEQ ID NO: 5, and SEQ IDNO: 7 and they can be additions to the amino acid sequence set forth inthe Sequence Listing as SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7, orany combinations thereof.

[0029] The homology between the protein coding regions of the nucleicacids encoding the allelic variants of the clk2, propin1 and the cote1proteins is preferably less than 20% divergent from the region of thenucleic acid set forth in the Sequence Listing as SEQ ID NO: 2, SEQ IDNO: 4 and SEQ ID NO: 6, respectively, encoding the proteins. In anotherembodiment, the corresponding nucleic acids are less than 18% divergentin their sequence identity. In another embodiment, the correspondingnucleic acids are less than 15% divergent in their sequence identity. Inanother embodiment, the corresponding nucleic acids are less than 12%divergent in their sequence identity. In another embodiment, thecorresponding nucleic acids are less than 10% divergent in theirsequence identity. In another embodiment, corresponding nucleic acidsare less than 7% divergent in their sequence identity. In anotherembodiment, the corresponding nucleic acids are less than 5% divergentin their sequence identity. In another embodiment, the correspondingnucleic acids are less than 3% divergent in their sequence identity. Inanother embodiment, the corresponding nucleic acids are less than 2%divergent in their corresponding sequence identity. In yet anotherembodiment, the corresponding nucleic acids are less than 1% divergentin their sequence identity

[0030] One skilled in the art will appreciate that nucleic acidsencoding homologs or allelic variants of the proteins set forth in theSequence Listing as SEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7 can beisolated in a manner similar to that used to isolate the nucleic acidsset forth in the Sequence Listing of the present invention as SEQ ID NO:1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ D NO: 6, SEQ ID NO: 8, SEQ ID NO: 9,and SEQ ID NO: 10. For example, given the sequence of the primers usedto amplify the nucleic acid set forth in the sequence listing as SEQ IDNO: 1, one can use these or similar primers to amplify a homologous genefrom other sources.

[0031] For example, SEQ ID NO: 9 and SEQ ID NO: 10 represent unsplicedmRNA for the propin1 and cote1 genes, respectively. Using the sequenceinformation provided herein, one skilled in the art can obtain thesequence of a primer that will hybridize to the desired message ofgenomic DNA, either in an intron, in an exon, or both, such thatcorresponding RNAs or DNAs from other organisms can readily be detected,and/or isolated. An example of a similar cross-species detection isprovided in the Example herein where cDNA molecules were hybridized togenomic DNA from various vertebrates and budding yeast to detectanalogous genes in those organisms. One can therefore use a spliced oran unspliced message, or a corresponding DNA, or a fragment thereof, asa probe to detect homologous sequences in other organisms, or similargenes from other individuals from the same species.

[0032] The present invention also contemplates any unique fragment ofthese genes, such as those encoding domains of the proteins set forth inthe Sequence Listing as SEQ ID NO: 3, SEQ ID NO: 5, and SEQ ID NO: 7 orof the nucleic acids set forth in any of SEQ ID NO: 1, SEQ ID NO: 2, SEQID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9, and SEQ ID NO: 10.To be unique, the fragment must be of sufficient size to distinguish itfrom other known sequences, most readily determined by comparing anynucleic acid fragment to the nucleotide sequences in computer databases,such as GenBank. Such comparative searches are standard in the art.Typically, a unique fragment usefull as a primer or probe will be atleast 20 to about 25 nucleotides in length depending upon the specificnucleotide content of the sequence. Additionally, fragments can be, forexample, at least about 30, 40, 50, 75, 100, 200 or 500 nucleotides inlength All of the genes, nucleic acids, and fragments of the genes andnucleic acids disclosed and contemplated herein can be single ormultiple stranded, depending on the purpose for which it is intended.

[0033] Once a nucleic acid encoding a particular protein of interest, ora region of that nucleic acid, is constructed, modified, or isolated,that nucleic acid can then be cloned into an appropriate vector, whichcan direct the in vivo or in vitro synthesis of that wild-type and/ormodified protein. The vector is contemplated to have the necessaryfunctional elements that direct and regulate transcription of theinserted gene, or nucleic acid. These functional elements include, butare not limited to, a promoter, regions upstream or downstream of thepromoter, such as enhancers that may regulate the transcriptionalactivity of the promoter, an origin of replication, appropriaterestriction sites to facilitate cloning of inserts adjacent to thepromoter, antibiotic resistance genes or other markers which can serveto select for cells containing the vector or the vector containing theinsert, RNA splice junctions, a transcription termination region, or anyother region which may serve to facilitate the expression of theinserted gene or hybrid gene. (See generally, Sambrook et al.).

[0034] There are numerous E. coli (Escherichia coli) expression vectorsknown to one of ordinary skill in the art which are useful for theexpression of the nucleic acid insert. Other microbial hosts suitablefor use include bacilli, such as Bacillus subtilis, and otherenterobacteriaceae, such as Salmonella, Serratia, and variousPseudomonas species. In these prokaryotic hosts one can also makeexpression vectors, which will typically contain expression controlsequences compatible with the host cell (e.g., an origin ofreplication). In addition, any number of a variety of well-knownpromoters will be present, such as the lactose promoter system, atryptophan (Trp) promoter system, a beta-lactamase promoter system, or apromoter system from phage lambda. The promoters will typically controlexpression, optionally with an operator sequence, and have ribosomebinding site sequences for example, for initiating and completingtranscription and translation If necessary, an amino terminal methioninecan be provided by insertion of a Met codon 5′ and in-frame with thedownstream nucleic acid insert. Also, the carboxy-terminal extension ofthe nucleic acid insert can be removed using standard oligonucleotidemutagenesis procedures.

[0035] Additionally, yeast expression can be used. There are severaladvantages to yeast expression systems. First, evidence exists thatproteins produced in a yeast secretion systems exhibit correct disulfidepairing. Second, post-translational glycosylation is efficiently carriedout by yeast secretory systems. The Saccharomyces cerevisiaepre-pro-alpha-factor leader region (encoded by the MF″-1 gene) isroutinely used to direct protein secretion from yeast. (Brake, et al.“∝-Factor-Directed Synthesis and Secretion of Mature Foreign Proteins inSaccharomyces cerevisiae.” Proc. Nat. Acad. Sci., 81:4642-4646 (1984)).The leader region of pre-pro-alpha-factor contains a signal peptide anda pro-segment which includes a recognition sequence for a yeast proteaseencoded by the KEX2 gene: this enzyme cleaves the precursor protein onthe carboxyl side of a Lys-Arg dipeptide cleavage signal sequence. Thenucleic acid coding sequence can be fused in-frame to thepre-pro-alpha-factor leader region. This construct is then put under thecontrol of a strong transcription promoter, such as the alcoholdehydrogenase I promoter or a glycolytic promoter. The nucleic acidcoding sequence is followed by a translation termination codon which isfollowed by transcription termination signals. Alternatively, thenucleic acid coding sequences can be fused to a second protein codingsequence, such as Sj26 or -galactosidase, used to facilitatepurification of the fusion protein by affinity chromatography. Theinsertion of protease cleavage sites to separate the components of thefusion protein is applicable to constructs used for expression in yeast.Efficient post translational glycosylation and expression of recombinantproteins can also be achieved in Baculovirus systems.

[0036] Mammalian cells permit the expression of proteins in anenvironment that favors important post-translational modifications suchas folding and cysteine pairing, addition of complex carbohydratestructures, and secretion of active protein Vectors useful for theexpression of active proteins in mammalian cells are characterized byinsertion of the protein coding sequence between a strong viral promoterand a polyadenylation signal. The vectors can contain genes conferringhygromycin resistance, gentamicin resistance, or other genes orphenotypes suitable for use as selectable markers, or methotrexateresistance for gene amplification. The chimeric protein coding sequencecan be introduced into a Chinese hamster ovary (CHO) cell line using amethotrexate resistance-encoding vector, or other cell lines usingsuitable selection markers. Presence of the vector DNA in transformedcells can be confirmed by Southern blot analysis. Production of RNAcorresponding to the insert coding sequence can be confirmed by Northernblot analysis. A number of other suitable host cell lines capable ofsecreting intact human proteins have been developed in the art, andinclude the CHO cell lines, HeLa cells, mycloma cell lines, Jurkatcells, etc. Expression vectors for these cells can include expressioncontrol sequences, such as an origin of replication, a promoter, anenhancer, and necessary information processing sites, such as ribosomebinding sites, RNA splice sites, polyadenylation sites, andtranscriptional terminator sequences. Preferred expression controlsequences are promoters derived from immunoglobulin genes, SV40,Adenovirus, Bovine Papilloma Virus, etc. The vectors containing thenucleic acid segments of interest can be transferred into the host cellby well-known methods, which vary depending on the type of cellularhost. For example, calcium chloride transformation is commonly utilizedfor prokaryotic cells, whereas calcium phosphate, DEAE dextran, orlipofectin mediated transfection or electroporation may be used forother cellular hosts.

[0037] Alternative vectors for the expression of genes or nucleic acidsin mammalian cells, those similar to those developed for the expressionof human gamma-interferon, tissue plasminogen activator, clotting FactorVIII, hepatitis B virus surface antigen, protease Nexin1, and eosinophilmajor basic protein, can be employed. Further, the vector can includeCMV promoter sequences and a polyadenylation signal available forexpression of inserted nucleic acids in mammalian cells (such as COS-7).

[0038] Insect cells also permit the expression of mammalian proteins.Recombinant proteins produced in insect cells with baculovirus vectorsundergo post-translational modifications similar to that of wild-typeproteins. Briefly, baculovirus vectors useful for the expression ofactive proteins in insect cells are characterized by insertion of theprotein coding sequence downstream of the Autographica californicanuclear polyhedrosis virus (AcNPV) promoter for the gene encodingpolyhedrin, the major occlusion protein. Cultured insect cells such asSpodoptera frugiperda cell lines are transfected with a mixture of viraland plasmid DNAs and the viral progeny are plated. Deletion orinsertional inactivation of the polyhedrin gene results in theproduction of occlusion negative viruses which form plaques that aredistinctively different from those of wild-type occlusion positiveviruses. These distinctive plaque morphologies allow visual screeningfor recombinant viruses in which the AcNPV gene has been replaced with ahybrid gene of choice.

[0039] Alternatively, the genes or nucleic acids of the presentinvention can be operatively linked to one or more of the functionalelements that direct and regulate transcription of the inserted gene asdiscussed above and the gene or nucleic acid can be expressed. Forexample, a gene or nucleic acid can be operatively linked to a bacterialor phage promoter and used to direct the transcription of the gene ornucleic acid in vitro. A further example includes using a gene ornucleic acid provided herein in a coupled transcription-translationsystem where the gene directs transcription and the RNA thereby producedis used as a template for translation to produce a polypeptide. Oneskilled in the art will appreciate that the products of these reactionscan be used in many applications such as using labeled RNAs as probesand using polypeptides to generate antibodies or in a procedure wherethe polypeptides are being administered to a cell or a subject.

[0040] Expression of the gene or nucleic acid, either in combinationwith a vector or operatively linked to an appropriate sequence, can beby either in vivo or in vitro. In vivo synthesis comprises transformingprokaryotic or eukaryotic cells that can serve as host cells for thevector. Alternatively, expression of the gene or nucleic acid can occurin an in vitro expression system. For example, in vitro transcriptionsystems are commercially available which are routinely used tosynthesize relatively large amounts of mRNA. In such in vitrotranscription systems, the nucleic acid encoding the desired gene wouldbe cloned into an expression vector adjacent to a transcriptionpromoter. For example, the Bluescript II cloning and expression vectorscontain multiple cloning sites which are flanked by strong prokaryotictranscription promoters. (Stratagene Cloning Systems, La Jolla, Calif.).Kits are available which contain all the necessary reagents for in vitrosynthesis of an RNA from a DNA template such as the Bluescript vectors.(Stratagene Cloning Systems, La Jolla, Calif.). RNA produced in vitro bya system such as this can then be translated in vitro to produce thedesired protein. (Stratagene Cloning Systems, La Jolla, Calif.).

[0041] If ex vivo methods are employed, cells or tissues can be removedand maintained outside the body according to standard protocols wellknown in the art. The nucleic acids of this invention can be introducedinto the cells via any gene transfer mechanism, such as, for example,virus-mediated gene delivery, calcium phosphate mediated gene delivery,electroporation, microinjection or proteoliposomes. The transduced cellscan then be infused (e.g., in a pharmaceutically acceptable carrier) orhomotopically transplanted back into a subject per standard methods forthe cell or tissue type. Standard methods are known for transplantationor infusion of various cells into a subject

[0042] The nucleic acids of this invention can also be utilized for invivo gene therapy techniques. With regard to gene therapy applications,the nucleic acid can comprise a nucleotide sequence which encodes a geneproduct which is meant to function in the place of a defective geneproduct and restore normal function to a cell which functionedabnormally due to the defective gene product. Alternatively, the nucleicacid may encode a gene product which was not previously present in acell or was not previously present in the cell at a therapeuticconcentration, whereby the presence of the exogenous gene product orincreased concentration of the exogenous gene product imparts atherapeutic benefit to the cell and/or to a subject. For example, thenucleic acid of this invention can include but is not limited to, a geneencoding a gene product involved in Gaucher's disease.

[0043] For in vivo administration, the cells can be in a subject and thenucleic acid can be administered in a pharmaceutically acceptablecarrier. The subject can be any animal in which it is desirable toselectively express a nucleic acid in a cell. In a preferred embodiment,the animal of the present invention is a human. In addition, non-humananimals which can be treated by the method of this invention caninclude, but are not limited to, cats, dogs, birds, horses, cows, goats,sheep, guinea pigs, hamsters, gerbils and rabbits, as well as any otheranimal in which selective expression of a nucleic acid in a cell can becarried out according to the methods described herein.

[0044] In the method described above which includes the introduction ofexogenous DNA into the cells of a subject (i.e., gene transduction ortransfection), the nucleic acids of the present invention can be in theform of naked DNA or the nucleic acids can be in a vector for deliveringthe nucleic acids to the cells for expression of the nucleic acid insidethe cell. The vector can be a commercially available preparation, suchas an adenovirus vector (Quantum Biotechnologies, Inc. (Laval, Quebec,Canada). Delivery of the nucleic acid or vector to cells can be via avariety of mechanisms. As one example, delivery can be via a liposome,using commercially available liposome preparations such as Lipofectin®,Lipofectamine® (GIBCO-BRL, Inc., Gaithersburg, Md.), Superfect® (Qiagen,Inc. Hilden, Germany) and Transfectam® (Promega Biotec, Inc., Madison,Wis.), as well as other liposomes developed according to proceduresstandard in the art. In addition, the nucleic acid or vector of thisinvention can be delivered in vivo by electroporation, the technologyfor which is available from Genetronics, Inc. (San Diego, Calif.) aswell as by means of a Sonoporation machine (ImaRx Pharmaceutical Corp.,Tucson, Ariz.).

[0045] As one example, vector delivery can be via a viral system, suchas a retroviral vector system which can package a recombinant retroviralgenome. The recombinant retrovirus can then be used to infect andthereby deliver nucleic acid to the infected cells. The exact method ofintroducing the nucleic acid into mammalian cells is, of course, notlimited to the use of retroviral vectors. Other techniques are widelyavailable for this procedure including the use of adenoviral vectors,adeno-associated viral (AAV) vectors, lentiviral vectors, pseudotypedretroviral vectors, and pox virus vectors, such as vaccinia virusvectors. Physical transduction techniques can also be used, such asliposome delivery and receptor-mediated and other endocytosis mechanism.This invention can be used in conjunction with any of these or othercommonly used gene transfer methods.

[0046] The nucleic acid and the nucleic acid delivery vehicles of thisinvention, (e.g., viruses; liposomes, plasmids, vectors) can be in apharmaceutically acceptable carrier for in vivo administration to asubject By “pharmaceutically acceptable” is meant a material that is notbiologically or otherwise undesirable, i.e., the material may beadministered to a subject, along with the nucleic acid or vehicle,without causing any undesirable biological effects or interacting in adeleterious manner with any of the other components of thepharmaceutical composition in which it is contained. The carrier wouldnaturally be selected to minimize any degradation of the activeingredient and to minimize any adverse side effects in the subject, aswould be well known to one of skill in the art.

[0047] The nucleic acid or vehicle may be administered orally,parenterally (e.g., intravenously), by intramuscular injection, byintraperitoneal injection, transdermally, extracorporeally, topically orthe like. The exact amount of the nucleic acid or vector required willvary from subject to subject, depending on the species, age, weight andgeneral condition of the subject, the severity or mechanism of anydisorder being treated, the particular nucleic acid or vehicle used, itsmode of administration and the like.

[0048] Also provided by the present invention is an isolateddouble-stranded nucleic acid, consisting of 1) a single-stranded DNAwhich has an apparent molecular size of 6.4 Kb and is derived fromhumans, and 2) a DNA complementary to the single-stranded DNA, givingthe same restriction map as that obtained with the nucleic acid setforth in SEQ ID NO: 8, such as that of the corresponding region of therestriction map of FIG. 1, which represents a genomic nucleic acidcorresponding to an unspliced RNA and therefore includes introns of ahuman gene encoding propin 1.

[0049] The term “apparent molecular size” as used herein means anestimated size obtained upon running the nucleic acid on a gel such asan agarose or polyacrylamide gel, or by other molecular size estimatestechniques, such as molecular weight filtration. Briefly, the nucleicacid sample is applied to the gel and electrophoresed simultaneouslywith nucleic acid markers of known molecular size. Upon completion ofgel electrophoresis, the size of the nucleic acid sample is estimated bycomparing its mobility with the relative mobilities of the nucleic acidmarkers.

[0050] The invention further provides for a single-stranded RNAcorresponding to the single-stranded DNA of the double-stranded nucleicacid set forth in SEQ ID NO: 8, and a single-stranded RNA correspondingto the complementary DNA of the double-stranded nucleic acid set forthin SEQ ID NO: 8.

[0051] The invention also provides an isolated double-stranded nucleicacid consisting of 1) a single-stranded DNA which has an apparentmolecular size of 8.3 Kb and is derived from humans, and 2) a DNAcomplementary to the single-stranded DNA, giving the same restrictionmap as that obtained with the nucleic acid of SEQ ID NO: 9, such as thatof the corresponding region of the restriction map of FIG. 1, whichrepresents a genomic nucleic acid corresponding to an unspliced RNA andtherefore includes introns of a human gene encoding cote 1.

[0052] The invention further provides for a single-stranded RNAcorresponding to the single-stranded DNA of the double-stranded nucleicacid set forth in SEQ ID NO: 9, and a single-stranded RNA correspondingto the complementary DNA of the double-stranded nucleic acid set forthin SEQ ID NO: 9.

[0053] The invention further provides an isolated double-strandednucleic acid, consisting of 1) a single-stranded DNA which has anapparent molecular size of 1.04 Kb and is derived from humans, and 2) aDNA complementary to the single-stranded DNA, which is a double-strandednucleic acid corresponding to a cDNA of the genomic nucleic acid andtherefore does not include any introns of a human gene encoding propin1.

[0054] The invention also provides a single-stranded RNA correspondingto the single-stranded DNA of the double-stranded nucleic acid set forthin SEQ ID NO: 4, and a single-stranded RNA corresponding to thecomplementary DNA of the double-stranded nucleic acid set forth in SEQID NO: 4.

[0055] The invention also relates to an isolated double-stranded nucleicacid consisting of 1) a single-stranded DNA which has an apparentmolecular size of 2.01 Kb and is derived from humans, and 2) a DNAcomplementary to the single-stranded DNA, which represents a cDNA of thegenomic nucleic acid sequence and therefore does not include any intronsof a human gene encoding cote 1.

[0056] Also provided by the present invention is a single-stranded RNAcorresponding to the single-stranded DNA of the double-stranded nucleicacid set forth in SEQ ID NO: 6, and a single-stranded RNA correspondingto the complementary DNA of the double-stranded nucleic acid set forthin SEQ ID NO: 6.

[0057] The invention also provides an isolated double-stranded nucleicacid consisting of 1) a single-stranded DNA which has an apparentmolecular size of 1.8 Kb and is derived from humans, and 2) a DNAcomplementary to the single-stranded DNA, and which is a variant of thenucleic acid encoding cote1.

[0058] Also provided by the present invention is a single-stranded RNAcorresponding to the single-stranded DNA which has an apparent molecularsize of 1.8 Kb and is derived from humans, and a single-stranded RNAcorresponding to the complementary DNA of the single-stranded DNA.

[0059] The invention also relates to an isolated double-stranded nucleicacid consisting of 1) a single-stranded DNA which has an apparentmolecular size of 6.0 Kb and is derived from humans, and 2) a DNAcomplementary to the single-stranded DNA, and which is a variant of thenucleic acid encoding cote 1.

[0060] The invention further provides a single-stranded RNAcorresponding to the single-stranded DNA which has an apparent molecularsize of 6.0 Kb and is derived from humans, and a single-stranded RNAcorresponding to the complementary DNA of the single-stranded DNA.

[0061] The invention also relates to an isolated double-stranded nucleicacid consisting of 1) a single-stranded DNA which has a molecular sizeof 3.0 to 3.4 Kb and is derived from humans, and 2) a DNA complementaryto the single-stranded DNA, and which is a variant of the nucleic acidencoding cote 1.

[0062] The invention further provides a single-stranded RNAcorresponding to the single-stranded DNA which has an apparent molecularsize of 3.0 to 3.4 Kb and is derived from humans, and a single-strandedRNA corresponding to the complementary DNA of the single-stranded DNA.

[0063] In another aspect, the invention provides a polypeptide encodedby the nucleic acid set forth in any of SEQ ID NO: 2, SEQ ID NO: 4, andSEQ ID NO: 6 and the polypeptides of any of SEQ ID NO: 3, SEQ ID NO: 5,and SEQ ID NO: 7, and the nucleic acids encoding the polypeptides of SEQID NO: 3, SEQ ID NO: 5, and SEQ ID NO: 7.

[0064] These polypeptides can also be obtained in any of a number ofprocedures well known in the art. One method of producing a polypeptideis to link two peptides or polypeptides together by protein chemistrytechniques. For example, peptides or polypeptides can be chemicallysynthesized using currently available laboratory equipment using eitherFmoc (9-fluorenylmethyloxycarbonyl) or Boc (tert-butyloxycarbonoyl)chemistry. (Applied Biosystems, Inc., Foster City, Calif.). One skilledin the art can readily appreciate that a peptide or polypeptidecorresponding to a particular protein can be synthesized by standardchemical reactions. For example, a peptide or polypeptide can besynthesized and not cleaved from its synthesis resin whereas the otherfragment of a hybrid peptide can be synthesized and subsequently cleavedfrom the resin, thereby exposing a terminal group which is functionallyblocked on the other fragment. By peptide condensation reactions, thesetwo fragments can be covalently joined via a peptide bond at theircarboxyl and amino termini, respectively, to form a larger polypeptide.(Grant, “Synthetic Peptides: A User Guide,” W. H. Freeman and Co., N.Y.(1992) and Bodansky and Trost, Ed., “Principles of Peptide Synthesis,”Springer-Verlag Inc., N.Y. (1993)). Alternatively, the peptide orpolypeptide can be independently synthesized in vivo as described above.Once isolated, these independent peptides or polypeptides may be linkedto form a larger protein via similar peptide condensation reactions.

[0065] For example, enzymatic ligation of cloned or synthetic peptidesegments can allow relatively short peptide fragments to be joined toproduce larger peptide fragments, polypeptides or whole protein domains(Abrahmsen et al. Biochemistry, 30:4151 (1991)). Alternatively, nativechemical ligation of synthetic peptides can be utilized to syntheticallyconstruct large peptides or polypeptides from shorter peptide fragments.This method consists of a two step chemical reaction (Dawson et al.“Synthesis of Proteins by Native Chemical Ligation,” Science,266:776-779 (1994)). The first step is the chemoselective reaction of anunprotected synthetic peptide-∝-thioester with another unprotectedpeptide segment containing an amino-terminal Cys residue to give athioester-linked intermediate as the initial covalent product. Without achange in the reaction conditions, this intermediate undergoesspontaneous, rapid intramolecular reaction to form a native peptide bondat the ligation site. Application of this native chemical ligationmethod to the total synthesis of a protein molecule is illustrated bythe preparation of human interleukin 8 (IL-8) (Clark-Lewis et al. FEBSLett., 307:97 (1987), Clark-Lewis et al, J.Biol.Chem., 269:16075 (1994),Clark-Lewis et al. Biochemistry, 30:3128 (1991), and Rajarathnam et al.Biochemistry, 29:1689 (1994)).

[0066] Alternatively, unprotected peptide segments can be chemicallylinked where the bond formed between the peptide segments as a result ofthe chemical ligation is an unnatural (non-peptide) bond (Schnolzer etal. Science, 256:221(1992)). This technique has been used to synthesizeanalogs of protein domains as well as large amounts of relatively pureproteins with full biological activity (deLisle Milton et al.“Techniques in Protein Chemistry IV,” Academic Press, New York, pp.257-267 (1992)).

[0067] The present invention also contemplates DNA probes for detectingthe propin1 gene of the locus of SEQ ID NO: 1, wherein the DNA probehybridizes to the nucleotide sequence set forth in the Sequence Listingas any of SEQ ID NO: 4 and/or SEQ ID NO: 8, and DNA probes for detectingthe cote1 gene, wherein the DNA probe hybridizes to the nucleotidesequence set forth in the Sequence Listing as any of SEQ ID NO: 6 and/orSEQ ID NO: 9.

[0068] As used herein, the tern “DNA probe” refers to a nucleic acidfragment that selectively hybridizes under stringent conditions with anucleic acid comprising a nucleic acid set forth in a sequence listedherein. This hybridization must be specific. The degree ofcomplementarity between the hybridizing nucleic acid and the sequence towhich it hybridizes should be at least enough to exclude hybridizationwith a nucleic acid encoding an unrelated protein.

[0069] Allelic variants can be identified and isolated by nucleic acidhybridization techniques. Probes selective to the nucleic acid set forthin the Sequence Listing as any of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 9 and SEQ ID NO: 10 can besynthesized and used to probe the nucleic acid from various cells,tissues, libraries etc. High sequence complementarity and stringenthybridization conditions can be selected such that the probe selectivelyhybridizes to allelic variants of the sequence set forth in the SequenceListing as any of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO:6, SEQ ID NO: 8, SEQ ID NO: 9 and SEQ ID NO: 10. For example, Theselectively hybridizing nucleic acids of the invention can have at least70%, 80%, 85%, 90%, 95%, 97%, 98% and 99% complementarity with thesegment of the sequence to which it hybridizes. The nucleic acids can beat least 12, 50, 100, 150, 200, 300, 500, 750, or 1000 nucleotides inlength Thus, the nucleic acid can be a coding sequence for the clk2,propin1 or cote1 proteins or fragments there of that can be used as aprobe or primer for detecting the presence of these genes. If used asprimers, the invention provides compositions including at least twonucleic acids which hybridize with different regions so as to amplify adesired region. Depending on the length of the probe or primer, targetregion can range between 70% complementary bases and fullcomplementarity and still hybridize under stringent conditions. Forexample, for the purpose of diagnosing the presence of an allelicvariant of the sequence set forth in the Sequence Listing as any of SEQID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ IDNO: 9 and SEQ ID NO: 10, the degree of complementarity between thehybridizing nucleic acid (probe or primer) and the sequence to which ithybridizes is at least enough to distinguish hybridization with anucleic acid from other bacteria. The invention provides examples ofnucleic acids unique to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 4, SEQ IDNO: 6, SEQ ID NO: 8, SEQ ID NO: 9 and SEQ ID NO: 10 in the SequenceListing so that the degree of complementarity required to distinguishselectively hybridizing from nonselectively hybridizing nucleic acidsunder stringent conditions can be clearly determined for each nucleicacid.

[0070] “Stringent conditions” refers to the washing conditions used in ahybridization protocol. In general, the washing conditions should be acombination of temperature and salt concentration chosen so that thedenaturation temperature is approximately 5-20° C. below the calculatedT_(m) of the nucleic acid hybrid under study. The temperature and saltconditions are readily determined empirically in preliminary experimentsin which samples of reference DNA immobilized on filters are hybridizedto the probe or protein coding nucleic acid of interest and then washedunder conditions of different stringencies. The T_(m) of such anoligonucleotide can be estimated by allowing 20° C. for each A or Tnucleotide, and 4° C. for each G or C. For example, an 18 nucleotideprobe of 50% G+C would, therefore, have an approximate T_(m) of 54° C.

[0071] Also provided herein are purified antibodies that selectivelybind to the polypeptides provided and contemplated herein, or purifiedantibodies which selectively bind to a polypeptide encoded by thenucleic acid set forth in any of SEQ ID NO: 2, SEQ ID NO: 4, and SEQ IDNO: 6, and purified antibodies which selectively bind to a polypeptideencoded by a nucleic acid encoding the polypeptide set forth in any ofSEQ ID NO: 3, SEQ ID NO: 5 and SEQ ID NO: 7. The antibody (eitherpolyclonal or monoclonal) can be raised to any of the polypeptidesprovided and contemplated herein, in its naturally occurring form and inits recombinant form. The antibody can be used in techniques orprocedures such as diagnostics, treatment, or vaccination. Antibodiescan be made by many well-known methods (See, e.g. Harlow and Lane,“Antibodies; A Laboratory Manual” Cold Spring Harbor Laboratory, ColdSpring Harbor, N.Y., (1988)). Briefly, purified antigen can be injectedinto an animal in an amount and in intervals sufficient to elicit animmune response. Antibodies can either be purified directly, or spleencells can be obtained from the animal. The cells can then fused with animmortal cell line and screened for antibody secretion. The antibodiescan be used to screen nucleic acid clone libraries for cells secretingthe antigen. Those positive clones can then be sequenced. (See, forexample, Kelly et al. Bio/Technology, 10:163-167 (1992); Bebbington etal. Bio/Technology, 10:169-175 (1992)).

[0072] The phrase “specifically binds” with the polypeptide refers to abinding reaction which is determinative of the presence of the proteinin a heterogeneous population of proteins and other biologics. Thus,under designated immunoassay conditions, the specified antibodies boundto a particular protein do not bind in a significant amount to otherproteins present in the sample. Selective binding to an antibody undersuch conditions may require an antibody that is selected for itsspecificity for a particular protein. A variety of immunoassay formatsmay be used to select antibodies that selectively bind with a particularprotein. For example, solid-phase ELISA immunoassays are routinely usedto select antibodies selectively immunoreactive with a protein. SeeHarlow and Lane “Antibodies, A Laboratory Manual” Cold Spring HarborPublications, New York, (1988), for a description of immunoassay formatsand conditions that could be used to determine selective binding.

[0073] This invention also contemplates producing a selected cell lineor a non-human transgenic animal model for the analysis of the functionof a gene comprising introducing into an embryonic stem cell a vectorhaving a selectable marker which, when the vector is inserted within agene, the inserted vector can inhibit the expression of the gene,selecting embryonic stem cells expressing the selectable marker,excising the vector from the embryonic stem cells expressing theselectable marker such that host DNA from the gene is linked to theexcised vector, sequencing the host DNA in the excised vector, comparingthe sequence of the host DNA to known gene sequences to determine whichhost DNA is from a gene for which a model for the analysis of thefunction the gene is desired, selecting the embryonic stem cellcontaining the inhibited gene for which a model for the analysis of genefunction is desired, and forming a cell line or a non-human transgenicanimal from the selected embryonic stem cell.

[0074] It is also contemplated in this invention that transgenic animalscan be produced which either overproduce the polypeptides of thisinvention or fail to produce the polypeptides of this invention in afunctional form. For example, a transgenic animal which overproduces thepropin1 of this invention can be produced according to methods wellknown in the art whereby nucleic acid encoding propin1 is introducedinto embryonic stem cells, at which stage it is incorporated into thegermline of the animal, resulting in the production of propin1 in thetransgenic animal in increased amounts relative to a normal animal ofthe same species. One skilled in the art can determine if overproductionor under production of propin1 results in altered GBA expression.

[0075] A transgenic animal in which the expression of propin1, forexample, is tissue specific is also contemplated for this invention. Forexample, transgenic animals that express or overexpress these genes atspecific sites such as the brain can be produced by introducing anucleic acid into the embryonic stem cells of the animal, wherein thenucleic acid is under the control of a specific promoter which allowsexpression of the nucleic acid in specific types of cells (e.g., aneuronal promoter which allows expression only in neuronal cells. Oneskilled in the art can determine if a tissue-specific alteration inclk2, propin1 or cote1 expression results in altered GBA expression byassaying for GBA expression in both the non-transgenic and thetransgenic animal.

[0076] Alternatively, the transgenic animal of this invention can be a“knock out” animal (see, e.g., Willnow et al., 1996), which can be ananimal that, for example, normally produces propin1 but has been alteredto prevent the expression of the animal's nucleic acid which encodespropin1, thereby resulting in an animal which does not produce propin1in a functional form. Such an animal may lack the ability to express allof the nucleic acids encoding propin1 or the transgenic animal may lackthe ability to express some (one or more than one) but not all of thenucleic acids encoding the propin1.

[0077] For example, the transgenic “knock out” animal of this inventioncan have the expression of a gene or genes knocked out in specifictissues. This approach obviates viability problems that can beencountered if the expression of a widely expressed gene is completelyabolished in all tissues. One skilled in the art could determine whetheror not the “knock out” has influenced the expression of GBA by assayingfor GBA in both the non-transgenic and transgenic animal. The knock-outmice can also be utilized to correlate particular genotypes withclinical presentation of Gaucher's disease.

[0078] The present invention is more particularly described in thefollowing examples which are intended as illustrative only sincenumerous modifications and variations therein will be apparent to thoseskilled in the art.

EXAMPLES

[0079] Isolation of Genomic Clones: The Center for Genetics in Medicine(CGM) YAC library (Burke and Olson 1991. “Preparation of clone librariesin yeast artificial-chromosome vectors.” In Methods in Enzymology. Guideto Yeast Genetics and Molecular Biology (eds. C. Guthrie, G. R Fink),pp. 251-270. Academic Press, San Diego, Calif.) was screened by PCRamplification of ordered arrays of pooled clones (Green and Olson 1990.“Systematic screening of yeast artificial-chromosome libraries by use ofthe polymerase chain reaction.” Proc. Natl. Acad. Sci.USA 87:1213-1217.) using a primer pair specific for the human GBA gene. A YACclone, CGY2981, contained both GBA and psGBA. A his3 mutation wasintroduced into the yeast from strain MB11, and the his5 mutationremoved by standard mating procedures (Sherman 1991. “Getting startedwith yeast.” In Methods in Enzymology. Guide to Yeast Genetics andMolecular Biology (eds. C. Guthrie, G. R. Fink), pp. 3-21. AcademicPress, San Diego, Calif.) The resulting strain, MNG1, was transformedwith fragmentation vectors pBP108 and pBP109 (Pavan et al. 1991.“High-efficiency yeast artificial chromosome fragmentation vectors.”Gene 106:125-127.) to reduce the insert size. One of the derivatives,MNG4, with an insert of ˜85 kb, was confirmed to contain both GBA andpsGBA and following partial Sau3AI digestion, was subcloned to Lambda,Dash (Stratagene, La Jolla, Ga.). The full-length YAC clone, MNG1,following partial SauA1 digestion, was also subcloned into SuperCos(Stratagene).

[0080] Several subclones that hybridized to a human 1.6-kb GBA cDNA weresubcloned into pGEM7zf+ (Promega, Madison, Wis.) for sequencing. Plasmidsubclones were ordered using sequence across the junction points in theDNA. A cosmid subclone of the YAC containing 16 kb of DNA upstream ofGBA was also subcloned to pGEM7zf+ and pGEM11zf+ for sequencing.

[0081] Complementary DNA Clones: A human hippocampal cDNA (Lambda ZapII)was obtained from Stratagene. The human brain plasmid library in apCMV-SPORT was from Life Technologies (Gaithersburg, Md.).

[0082] RACE PCR was used to determine the 5′ ends of cDNAs. PCRamplification was carried out using human brain cDNA ligated to ananchor (Clontech) or on cDNA isolated from the pSPORT human brainlibrary with anchor or vector primers and nested gene-specific primers.

[0083] DNA Sequence Analysis: Plasmid and cosmid DNA were prepared byeither standard alkaline lysis (Maniatis et al. 1982. “Molecularcloning: A laboratory manual.” Cold Spring Harbor Laboratory, ColdSpring Harbor, N.Y.), Perfect Prep kits (5 Prime →3 Prime, Boulder,Co.), or in some cases by CsCl gradient purification. Sequencing wasperformed on alkaline-denatured, double-stranded template using theSequenase Reagent kit (U.S. Biochemical, Inc., Cleveland, Ohio) or onsubclones using the Circumvent ThermoCycle Dideoxy DNA Sequencing kit(New England Biolabs, Beverly, Mass.) according to manufacturer'sprotocols, PCR products, cosmids, and subclones were sequenced using aPerkin-Elmer Applied Biosystems model 373A automated sequencer with theFS enzyme system and dye terminator chemistry according to themanufacturer's protocols (Perkin Elmer, Foster City, Calif.).Oligonucleotides were synthesized on an Expedite (Perseptive Biosystems,Framingham, Mass.) or Cyclone synthesizer (Milli. Bioresearch, Milford,Mass.) and used without purification. All sequence was determined inboth directions, entered into PC-Gene and analyzed by GRAIL (Ubersbacherand Mural 1991. “locating protein-coding regions in human DNA sequencesby a multiple sensor-neural network approach.” Proc. Natl. Acad. Sci.88: 11261-11265.) to determine potential coding areas. A BLAST-N(Altschul et al. 1990. “Basic local alignment search tool.” J. Mol.Biol. 215: 403-410.) comparison of putative exon sequences with theGenBank database was used to identify homologies. The BLAST-X (Altschulet al. 1990) program was also used to compare the predicted amino acidsequence to the protein database.

[0084] Northern and Zooblot Hybridization: Multiple human tissueNorthern blots and multiple species (Zoo) blots were obtained fromClontech Laboratories. Hybridization was carried out according to themanufacturer's protocol using cDNA from clk2 kinase, propin1, cote1, and-actin as probes.

[0085] The hybridization conditions used in the Northern and Zooblothybridization experiments described herein can be used to detect nucleicacid species from other organisms that correspond to the clk2, propin1,and cote1 genes of the present invention. For example, a representativehybridization experiment would consist of prehybridizing the filter(s)for 1 to 4 hours at 42° C. in a prehybridization mix (50% v/v formamide,5×SSC, 5×Denhardts, 20 mM Na phosphate pH 6.5, 1% v/v glycine, 100 μg/mlE.coli RNA and 10 μg/ml poly A RNA in sterile water). Followingprehybridization, E.coli DNA is added to the probe using 100 μl per 10ml of hybridization mix used. The probe/E.coli DNA is boiled for 5minutes, then quick cooled on ice for 3-5 minutes. The probe/E.coli DNAis added to a hybridization mix (50% formamide, 5×SSC, 5×Denhardt's, 20mM Na phosphate pH 6.5, 10% dextran sulfate and 10 μg/ml poly A RNA) andmixed gently. The filter is then hybridized overnight at 42° C.Subsequent to hybridization, the filters are washed 2×10 minutes at roomtemperature with 6×SSC, washed 4×5 minutes at room temperature with2×SSC, 0.1% SDS, washed 2×15 minutes at room temperature with .01×SSC,0.1% SDS, and washed 1×20 minutes at 55° C. with 0.1×SSC, 0.1% SDS.

[0086] Detailed Information Regarding the Glucocerebrosidase Gene Locus

[0087] The following information gives the GENBANK accession informationregarding the sequence of the present invention, which includes thecoordinates for the splice junctions of the unspliced RNAs and thecoordinates for the start and the end of the RNAs of the genes withinthis locus: LOCUS AF023268 75270 bp DNA PRI 28-OCT-1997 DEFINITION Homosapiens clk2 kinase (CLK2), propin1, cote1, glucocerebrosidase (GBA),and metaxin genes, complete cds; metaxin pseudogene andglucocerebrosidase pseudogene; and thrombospondin3 (THBS3) gene, partialcds. ACCESSION AF023268 NID   g2564910 KEYWORDS SOURCE  human. ORGANISMHomo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia;Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to75270) AUTHORS Winfield, S. L., Tayebi, N., Martin, B. M., Ginns, E. I.and Sidransky, E. TITLE Identification of three additional genescontiguous to the glucocerebrosidase locus on chromosome 1q21:implications for Gaucher disease JOURNAL Genome Res. 7 (10), 1020-1026(1997) MEDLINE 97474796 REFERENCE 2 (bases 1 to 75270) AUTHORS Winfield,S. L., Tayebi, N., Martin, B. M., Ginns, E. I. and Sidransky, E.TITLE  Direct Submission JOURNAL Submitted (05-SEP-1997) ClinicalNeuroscience Branch, NIMH, 9000 Rockville Pike Bldg. 49 Rm. B1EE16,Bethesda, MD 20892, USA FEATURES    Location/Qualifiers source  1..75270/organism=“Homo sapiens” /db_xref =“taxon:9606” /chromosome=“1”/map=“1q21”mRNA    join(1..129,2354..2523,3624..3852,4550..4637,4987..5053,5220..5336,6458..6624,7401..7495,8580..8709,8804..8886,9055..9134,9315..9405,9956..10482) /gene=“CLK2” gene    1..10482/gene=“CLK2” CDS    join(2354..2523,3624..3852,4550..4637,4987..5053,5220..5336,6458..6624,7401..7495,8580..8709,8804..8886,9055..9134,9315..9405,9956..10138) /gene=“CLK2” /codon_start=1/product=“clk2 kinase” /db_xref=“PID:g2564911”/translation=“MPHPRRYHSSERGSRGSYREHYRSRKHKRRRSRSWSSSSDRTRRRRREDSYHVRSRSSYDDRSSDRRVYDRRYCGSYRRNDYSRDRGDAYYDTDYRHSYEYQRENSSYRSQRSSRRKHRRRRRRSRTFSRSSSQHSSRRAKSVEDDAEGHLIYHVGDWLQERYEIVSTLGEGTFGRVVQCVDHRRGGARVALKIIKNVEKYKEAARLEINVLEKINEKDPDNKNLCVQMFDWFDYHGHMCISFELLGLSTFDFLKDNNYLPYPIHQVRHMAFQLCQAVKFLHDNKLTHTDLKPENILFVNSDYELTYNLEKKRDERSVKSTAVRVVDFGSATFDHEHHSTIVSTRHYRAPEVILELGWSQPCDVWSIGCIIFEYYVGFTLFQTHDNREHLAMMERILGPIPSRMIRKTRKQKYFYRGRLDWDENTSAGRYVRENCKPLRRYLTSEAEEHHQLFDLIESMLEYEPAKRLTLGEALQHPFFARLRAEPPNKLWDSSRDISR”mRNA    join(10950..11272,11624..11701,12699..12821,12908..13028,14412..14540,15709..15868,15980..16081,16574..16691, 16947..17388)/product=“propin1”CDS    join(11207..11272,11624..11701,12699..12821,12908..13028,14412..14540,15709..15868,15980..16081,16574..16691, ATTORNEY DOCKET NO:14014.0296 16947..17093) /codon_start=1 /product=“propin1”/db_xref=“PID:g2564915”/translation=“MAQSRDGGNPFAEPSELDNPFQDPAVIQHRPSRQYATRDVYNPFETREPPPAYEPPAPAPLPPPSAPSLQPSRKLSPTEPKNYGSYSTQASAAAATAELLKKQEELNRKAEELDRRERELQHAALGGTATRQNNWPPLPSFCPVQPCFFQDISMEIPQEFQKTVSTMYYLWMCSTLALLLNFLACLASFCVETNNGAGFGLSILWVLLFTPCSFVCWYRPMYKAFRSDSSFNFFAFFFNFFDQDVLFVLQAIGIPGWGFSGWISALVVPKGNTAVSVLMLLVALLFTGIAVLGIVMLKRIHSLYRRTGASFQKAQQEFAAGVFSNPAVRTRAANAAAGAAENAFRAP”mRNA    join(17913..18715,18913..18969,19174..19284,19390..19509,19636..19743,21463..21638,21770..21869,22207..22276,22550..23095,24910..24978,25080..25333,25428..26165) /product=“cote1”CDS    join(18491..18715,18913..18969,19174..19284,19390..19509,19636..19743,21463..21638,21770..21869,22207..22276,22550.23095,24910..24978,25080..25333,25428..25601) /codon_start=1/product=“cote1” /db_xref=“PID:g2564916”/translation=“MMPSPSDSSRSLTSRPSTRGLTHLRLHRPWLQALLTLGLVQVLLGILVVTFSMVASSVTTTESIKRSCPSWAGFSLAFSGVVGIVSWKRPFTLVISFFSLLSVLCVMLSMAGSVLSCKNAQLARDFQQCSLEGKVCVCCPSVPLLRPCPESGQELKVAPNSTCDEARGALKNLLFSVCGLTICAAIICTLSAIVCCIQIFSLDLVHTQLAPERSVSGPLGPLGCTSPPPAPLLHTMLDLEEFVPPVPPPPYYPPEYTCSSETDAQSITYNGSMDSPVPLYPTDCPPSYEAVMGLRGDSQATLFDPQLHDGSCICERVASIVDVSMDSGSLVLSAIGDLPGGSSPSEDSCLLELQGSVRSVDYVLFRSIQRSRAGYCLSLDCGLRGPFEESPLPRRPPRAARSYSCSAPEAPPPLGAPTAARSCHRLEGWPPWVGPCFPELRRRVPRGGGRPAAAPPTRAPTRRFSDSSGSLTPPGHRPPHPASPPPLLLPRSHSDPGITTSSDTADFRDLYTKVLEEEAASVSSADTGLCSEACLFRLARCPSPKLLRARSAEKRRPVPTFQKVPLPSGPAPAHSLGDLKGSWPGRGLVTRFLQISRKAPDPSGTGAHGHKQVPRSLWGRPGRESLHLRSCGDLSSSSSLRRLLSGRRLERGTRPHSLSLNGGSRETGL”mRNA    join(32168..32299,32668..32755,33308..33499,33623..33769,34735..34868,35079..35251,35806..36043,36915..37139,37540..37703,38073..38189,38284..38931) /gene=“GBA”/product=“glucocerebrosidase” gene   32168..38931 /gene=“GBA”CDS    join(32273..32299,32668..32755,33308..33499,33623..33769,34735..34868,35079..35251,35806..36043,36915..37139,37540..37703,38073..38189,38284..38389) /gene=“GBA” /codon_start=1/product=“glucocerebrosidase” /db_xref=“PID:g2564914”/translation=“MEFSSPSREECPKPLSRVSIMAGSLTGLLLLQAVSWASGARPCIPKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPIQANHTGTGLLLTLQPEQKFQKVKGFGGAMTDAAALNILALSPPAQNLLLKSYFSEEGIGYNIIRVPMASCDFSIRTYTYADTPDDFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPWTSPTWLKTNGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPSAGLLSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRLLLPHWAKVVLTDPEAAKYVHGLAVHWYLDFLAPAKATLGETHRLFPNTMLFASEACVGSKFWEQSVRLGSWDRGMQYSHSIITNLLYHVVGWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTFYKQPMFYHLGHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVPLTIKDPAVGFLETISPGYSIHTYLWRRQ” misc_feature  complement(38948..42406)/note=“metaxin pseudogene” misc_feature  54895..59543/note=“glucocerebrosidase pseudogene”mRNA    complement(join(<59671..59885,59978..60132,60598..60674,60801..60983,61149..61241,62741..62820,62953..63022, 64036..>64116))/product=“metaxin”CDS    complement(join(59671..59885,59978..60132,60598..60674,60801..60983,61149..61241,62741..62820,62953..63022, 64036..64116))/codon_start=1 /product=“metaxin” /db_xref=“PID:g2564913”/translation=“MAAPMELFCWSGGWGLPSVDLDSLAVLTYARFTGAPLKVHKISNPWQSPSGTLPALRTSHGEVISVPHKIITHLRKEKYNADYDLSARQGADTLAFMSLLEEKLLPVLVHTFWIDTKNYVEVTRKWYAEAMPFPLNFFLPGRMQRQYMERLQLLTGEHRPEDEEELEKELYREARECLTLLSQRLGSQKFFFGDAPASLDAFVFSYLALLLQAKLPSGKLQVHLRGLHNLCAYCTHILSLYFPWDGAEVPPQRQTPAGPETEEEPYRRRNQILSVLAGLAAMVGYALLSGIVSIQRATPARAPGTRTLGMAEEDEEE”mRNA    join(<65492..65570,66961..67167,68050..68306,68409..68511,69827..69853,70061..70143,70229..70280,70404..70552,70963..71103,71330..71407,71795..71947,72161..72271,72361..72468,72754..72913,73251..73369,73508..73560,74757..74950,75138..>75270) /gene=“THBS3” /product=“thrombospondin3”gene    65492..>75270 /gene=“THBS3”CDS    join(65492..65570,66961..67167,68050..68306,68409..68511,69827..69853,70061..70143,70229..70280, 70404..70552,70963..71103,71330..71407,71795..71947,72161..72271,72361..72468,72754..72913,73251..73369,73508..73560,74757..74950,75138..>75270) /gene=“THBS3” /codon_start=1/product=“thrombospondin3” /db_xref=“PID:g2564912”/translation=“METQELRGALALLLLCFFTSASQDLQVIDLLTVGESRQMVAVAEKIRTALLTAGDIYLLSTFRLPPKQGGVLFGLYSRQDNTRWLEASVVGKINKVLVRYQREDGKVHAVNLQQAGLADGRTHTVLLRLRGPSRPSPALHLYVDCKLGDQHAGLPALAPIPPAEVDGLEIRTGQKAYLRMQGFVESMKIILGGSMARVGALSECPFQGDESIHSAVTNALHSILGEQTKALVTQLTLFNQILVELRDDIRDQVKEMSLIRNTIMECQVCGFHEQRSHCSPNPCFRGVDCMEVYEYPGYRCGPCPPGLQGNGTHCSDINECAHADPCFPGSSCINTMPGFHCEACPRGYKGTQVSGVGIDYARASKQVCNDIDECNDGNNGGCDPNSICTNTVGSFKCGPCRLGFLGNQSQGCLPARTCHSPAHSPCHIHAHCLFERNGAVSCQCNVGWAGNGNVCGTDTDIDGYPDQALPCMDNNKHCKQDNCLLTPNSGQEDADNDGVGDQCDDD ADGDGIKNVEDNCRLFPNKDQQNSDTDSFGDACDNCPNVPNNDQKDTDGNGEGDACDNDVDGDGIPNGLDNCPKVPNPLQTDRDEDGVGDACDSCPEMSNPTQTDADSDLVGDVCDTNEDSDGDGHQDTKDNCPQLPNSSQLDSDNDGLGDECDGDDDNDGIPDYVPPGPDNCRLVPNPNQKDSDGNGVGDVCEDDFDNDAVVDPLDVCPESAEVTLTDFRAYQTVVLDP”

[0088]

1 10 1 75270 DNA Homo sapien 1 tcccagggtc ccgggttggg ggggtggagcagcatttcgt cgccgcgggg gtgccgggac 60 tccggccgca gtgtcgccgc catcacggacttcctgtggg acaagcgcac gggcctcgcc 120 gccagaacgg tgagcgcgcg ggttggctggccgcgcgaaa agatggcgac cgcggggcgg 180 gcggggttag gcttgggttg ggggcaagggacgggggcga gatttggaga ggaggagagt 240 tgagacctac gaagcgacgg agtagggagatgagggggaa ggaggtcggc ctgcgttaga 300 tgtaccaaga agacctgtag gaagcttgggcggaaccaga ctagaaggga tgatagagta 360 aggatgggag tggttccaga gatgtgaaggagtggaatat caaagttgga ctcgggtcca 420 atcaaagctg gaggggagga atacatcaccgtgtcacttg gaacatgcag gaagaatttg 480 cctgtaggag ttctggattt tctgagtaagaaaggaaaaa gcaaaagatt tcaaacttga 540 gaaaggtagg ctagtgatgt tttgaggggacagtgctgtg aggccgtttg atggtagtat 600 gtggtgagat ggcatttatg tggctatgctgattaggata actatgctgg gaaaatcttg 660 tgcttgcagc ttggtatgct accggccccagttacaccaa gaatgcatca aataccaccc 720 tcctcaattt ggggaggttg ggaagaagtgtatacagaat ggtggtcaga catcggaatc 780 attaaacctt ttgcatacgc gattctcatacacccccacc aagacttatg tccctgggcc 840 tgtttcattt tattgtgcct tcctattctcatcttcctgg attgtgacag tgatttttct 900 cttttttcct tgaaggaaaa aaaggatttgtgaaccatcc cattccaatt ccattaactg 960 gaaggtggcc agctgtcagc taagtagggctgatattaat taaaaaccac ttactggcca 1020 ggcgcggtgg ctcacatctg taatcccagcactttgggag gccaaggcgg gcggatcacc 1080 tgaggttggg agttcgagac cagcctgaccaacatggaga aaccctgtct ctactaaaaa 1140 tacaaaatta gctgggcgtg gcggcgcatgcctgtaatcc cagctactcg ggaggctgag 1200 gcaggagaat cgcttgaacc ctgcaggcggaggttatggt gagccgagat tgcgccattg 1260 cactccagcc tgggcaacaa gagcaaaactccgttttcaa aaaaaaaaaa aaaaacacaa 1320 gaaacagttg ttatctacgt cttaagcacaagagtggagg ggccagatgg gatgctgact 1380 gtattttgtg gcctactaca ttaaaacctctcaagggaga gttggccaaa gatgtgagag 1440 atgattaagt taattcaagg ggcccgacccagatccctgg agctgatagt atttggcgtt 1500 tacctggtat atgtagaaac ctgggagactggattcctac cctcaggaca ctgatagtct 1560 acttgaggag atgagatgta ctcttaatgagtgtacaaca tttatcataa ctgtaaacca 1620 aagctttact ctattgactt gtgaggaactcagcatcaca cagatctata tcttagatcc 1680 tgaacgggga tacactgtgg cagtctccatcggtcgtcat acactttgct ctgtagtgaa 1740 ctgtgtccta gacctgaagg tcttcatggagctgtgtttg aagaaaataa ggatagaaca 1800 cttgaactgg ctgggtgcgg tggctcacgcctgtaatccc agcactttgg gaggccgagg 1860 caggaggatc acgagtcagg agtttgagaccatcctggcc aacatggtga aaccccgtct 1920 ctactaaaaa tataaaaatt agccaggagtggtggtgcgt gcctgtaatc ccagctacac 1980 aggaggctga ggcaggagag ttgcttgaacctgggaagtg gaggttgcag tgagccgaga 2040 ttgtgccact gcactccagc ctggccacacagcgagactc tgtctcaaga ataaaataag 2100 aacacttgaa ctgaggggtg ggtaatgagggatgggtttg ggggctttct agagtttcag 2160 ccagagttag taggtggctg tactccaataaggaggaatt gtaagtacag aggaagactg 2220 aggaagttgt gcctggtaaa aagaggcagaaataagaggg gaaagcagct tagcatcttt 2280 ggagcagcca gtccttggag gagatttctgaaaactcgaa actttcctat ttccgctttc 2340 ccttgccttc cagatgccgc atcctcgaaggtaccactcc tcagagcgag gcagccgggg 2400 gagttaccgt gaacactatc ggagccgaaagcataagcga cgaagaagtc gctcctggtc 2460 aagtagtagt gaccggacac gacggcgtcggcgagaggac agctaccatg tccgttctcg 2520 aaggtgaggc ctcaggaaga gatactcaaacacagggtca ctcgctcatt cctttgttca 2580 acaaatactt attggttata tgctaaaaccattgccttaa gtagctggct gaaggtaatg 2640 tcgggataga taaccacaga agccatttgggctgcagtgt taagtgttag aatacataat 2700 gaggaactga ggacgtaccc agtgcgtttggttggagtgg ggcaggatgt taattcagat 2760 aaaacattgc agaaagcaac atctgtatatgtctggagga caagtagagt caggtggggg 2820 aagtgtcttt gggcagaggg aaagcatgtgcaaaggactg ggagctacag gtaactaata 2880 agctaacaga taagacaaag ttagatctttgtgtttttaa ttttttaaaa attttaaaat 2940 atcatttatt tatttttgag acagagtttcactctgttgc ccaggctgga gtgcagtggc 3000 aagatctcag ctcactgcaa actctatggccccaggttca agttgattct tgtgcctcag 3060 cctcccaagt tacctgggac tacaggcatgcgccatcatg cctggctaat tttttttttt 3120 tttttttgta tttttagtag agacaggatttcaccatgtt ggcctcaaac tcctgacctc 3180 aagtgctcca cctgccttgg cctcccaaagtgctgggatt acaggcgtga gccactgtgc 3240 ccggcccagt tttaatattt tagtggtatgttcagtggaa agaatgggtt aaggatggtt 3300 tagagcaagg ggggatttca ttcaggaaagcttgagaaaa aaatggctct ttgtttcttt 3360 gtatcactgc agcgggaaaa aggccccaatagggagtggt tctgcagatg ccttttcaga 3420 ggttttggtg acatggcaga tgtgcttggttaacctcaac ttcttctggt cccttggtcc 3480 tctccagtga ctcaaacaag tgtctgagaacagtttttct tagcattcac tctttgtcct 3540 cactaaggca ggaccttgtc acagggtatggtagggaatg tgatctgaac agatacacct 3600 cattggtatt tgttatccaa cagcagttatgatgatcgtt cgtccgaccg gagggtgtat 3660 gaccggcgat actgtggcag ctacagacgcaacgattata gccgggatcg gggagatgcc 3720 tactatgaca cagactatcg gcattcctatgaatatcagc gggagaacag cagttaccgc 3780 agccagcgca gcagccggag gaagcacagacggcggagga ggcgcagccg gacatttagc 3840 cgctcatctt cggtgagtgc cagcccaggcccttcctctc cccactcttc tgcaggccct 3900 ctaggactct ggtaagtgag cagtatccttgttctcagct gaacattggg gcatgaacac 3960 tgaggtgggc actgagtttg cctactttcttggaagctct ccgactcttg aagggccctg 4020 gatctgcttt ggagatggat gggcacggagcatttgtgac ccccagtgct ctccctggca 4080 tgttgggctt attgtgttgg gagcagcttctccgccccag cggcctccac tctttaatgg 4140 ggaccttgct tgttgaactg cctttttcccccaagccctg ggctctgtag cccccttggc 4200 aggcgggctt gggtgggggc aaggggagatctgtgtctgc ccggaagggc attgtgtaga 4260 gcatggggtt gtgggtgaca ttggcaacaacaccacttct ctgctctgcc catgcctctc 4320 cctgttcctg ttttgggttt ggggacttgggattatgcgc cgctctctct tctcacaccg 4380 atccacctta ctgcatctga cgtgttcccttccactgtcc cccatcatcg tctgtccccc 4440 ctggctgggc gcctgtgacc ggtgacccctctaccaccca ccccgcctcc ctccggcccc 4500 cggctccgaa cactgggctt ggttcggaaccccccctgcc ccccgacagc agcacagcag 4560 ccggagagcc aagagtgtag aggacgacgctgagggccac ctcatctacc acgtcgggga 4620 ctggctacaa gagcgatgta caagccaaatcgtaacaatc ctatagcctg taatggtccc 4680 atagccatcc taacgtccca agccaacctgagtcacagcc tcttgccttt tgctcagtgg 4740 tgttgtttat ctgggtggtt tgagttgctgttgagaattg acctctgttg tccactccca 4800 gctctcacgg cccctgggtt aaaggtggtgggaatcacgc aggggttttc ttcccctgtg 4860 accatctgta tctgttcccc ttccttcatctccaccccag gttgctgtcc ccttttttct 4920 tccaactcag ctcattcccc accttctctccctccctctc cccgaccctg ctctctttca 4980 tttcagatga aatcgttagc accttaggagaggggacctt cggccgagtt gtacaatgtg 5040 ttgaccatcg caggtaactg tcagtccctccctactatgt ggggctaaag agatggttgg 5100 ggttatatgg ggcttttttg ctaattaacctgaggtagaa tttcttagtc cccctacagc 5160 cctgttcatt ttgagacatt cttgagaacccagcaaaagc ctctcctgcc aacttacagg 5220 ggtggggctc gagttgccct gaagatcattaagaatgtgg agaagtacaa ggaagcagct 5280 cgacttgaga tcaacgtgct agagaaaatcaatgagaaag accctgacaa caagaagtaa 5340 gcaagcaagg aagtgtgtag ggaggctgagagccccaacc cctacacggg agagattcct 5400 agacctggtt taggcagaca gggggagtcctagaccttct catccattca tcttttgttc 5460 atcatcttag agtagtagaa catcagggtaggaaaggggg ggggcccatg acattcctta 5520 atccactccc cttgttttca gggaagttagattgctctgt cgtttgaccc cagcctagaa 5580 tggcttctgt agagatatac cctggacatagccctaggac tgagcatcaa cctcatcgaa 5640 caatgaatta agcttttata agtagaactatgctgataag gccacagaca tttatgcagt 5700 aatattttgt tcacatacat tcacagtccaaaaatagaat aaggacttta agccaagata 5760 tgaggccagt aggtataaat aggagcccagtggtagctag tggtgaatga atatcaggag 5820 ttgggctggg gtttgggatt taggatggacagtttgatga ttccaggatc tatactgttt 5880 ggagcttggc actccacagc tcttccagcaaatagttttt gaactattta aaatgacgca 5940 ggaagatatt ttgaaacttg ggctgggcatggtggctcac gcctgtaatc ccagcacttt 6000 ggaggccgag gcgggtggat cacaatgtcaggagttcgag accagcctgg ccaatatggt 6060 gaaaccccgt ctgtactaaa aataaaaaaattagccgggc atggtggtgg acaactgtag 6120 tcccagctac ttggaaggct gaggcaggagaatcgcttga acaggaggca gaggttgcag 6180 tgagccaaga tcgcgccact gcactccagcctgggtgaca gagtgagacg ccatctcaaa 6240 aacaaaaaag aaaaaaacat ggcagatagtaccatttctt tcttgggtct ggtagaggct 6300 actccttagc tgaactgaat tttggtgttagtactagctg gcatggtttt acacaagtta 6360 tgtggaatca acagctatga agtacctccttgttggatgg ctgatgggca gatgggagct 6420 catcagacaa tcccccttcc cccatctctcttctcagcct ctgtgtccag atgtttgact 6480 ggtttgacta ccatggccac atgtgtatctcctttgagct tctgggcctt agcaccttcg 6540 atttcctcaa agacaacaac tacctgccctaccccatcca ccaagtgcgc cacatggcct 6600 tccagctgtg ccaggctgtc aagtgtgagtggggtgggcc gaagtggact ctggggcagt 6660 ccctcccttc attggatctc ttctgtcggttgtgcactgg tgaagcccct aaacagtcag 6720 ctgtctgtta tctgcagttc tttgatttactgtcatcttg aaacgtcttc tgacttaact 6780 ccttgactga tgtctttatc gtcactgattgctcttactc tacacctagc ctagcagcag 6840 ctagaagaga aagcctttgg aatcaaagcacttaattacc ctcccctttt cctttctccc 6900 tttcttggga aagcatcagt cagacagcaaacataaagag acaaaaatac actccttagg 6960 gtaaaggctt acatttgtct gggatgagatgttcattcac agcaaaggag atgggaacac 7020 agagtatgta gttcagctaa ggggcaaggtggggaattca aagaaatata tcattcctct 7080 ggagtcatca aaataataca gtttcacagaattgagttaa cataatgcca gctagacaca 7140 tgtcaaagtg tgtacacact acaactcaaagcaaactttt ttttttttta catcagttga 7200 gctacatatg tatcttacat tagaaagagcagagcttctt agaccaggca ttccatttag 7260 acgtagagcg gaaagcagcc ctagtgattctggacctgtc tcctcactgg ctttgcccta 7320 ggtaaccagg cctggggtca gctgataccagttagctctg gccacctgca ccaagcctga 7380 cctggccttc tcccctacag tcctccatgataacaagctg acacatacag acctcaagcc 7440 tgaaaatatt ctgtttgtga attcagactatgagctcacc tacaacctag agaaggtaag 7500 atggataggg tctgcccttg gttactgggggcaggcagct gcaccacttt gccttctgcc 7560 gagccctttg ttttctccct tttatttcgtcctcccacat tttctccctg actggctcca 7620 actgggtaaa actaaatagg ttgaaagggagaaatctctt aagaagactt aaattgggag 7680 ataacttgta caggggactt cagataactttccagtagag tgaaagtttt taaggttctt 7740 ggtttgggct ttatttattt atttatttatttatttattt atttatttga gacagagtct 7800 cactctgttg cccagactga tgtgcagtggcataatctcg gctcactgca acctctacct 7860 cccaggttta agtgattctc ctgcctcagcctctggagta gctggggtta caggcacccg 7920 ccaccacgcc tggctagttt ttgtgtttttagtagagatg gggttttgcc atgttggcca 7980 ggctggtctc gaacccctaa cctcaggtgatctgcctgtc tcgggctccc agaatgctgg 8040 gattacgggt gtgagccaca gtgcctggcccggacttctt ttttgagatg tgtatttctg 8100 tgaggtagga aagctcggct cctttgactaactggagata atgaagatct cagacctaaa 8160 gaagctctcc ctcctttgac ccctttgctacatgttacat atttttagag aaactccttt 8220 gctacatgtt acattttttc agaaaaacccatttgctaca tgttacattt tttcagaaaa 8280 acccatttgc tacatgttac atgttttcagaataactata tatctggcac ttgagtgtag 8340 tcttcagatg cttacagatg tgcacgctgttctataatca tttcttaatc cttttgctcc 8400 cctacttcta aaagtattat gctggatgtggtggaagctg taaaacagga atttcccagg 8460 cttgtaaggc caggcaggag atcacagtaccttttttaag gctcagaagc aatgtagaga 8520 atactggtgg aatctcagtc tgggtgcagaggttcatcct ctttctcctc tgctcctaga 8580 agcgagatga gcgcagtgtg aagagcacagctgtgcgggt ggtagacttt ggcagtgcca 8640 cctttgacca tgagcaccat agcaccattgtctccactcg ccattaccga gcaccagaag 8700 tcatccttgg taagggaggg caaggctgtccaagtgtgtg agatgatgtg agggtggggc 8760 cacctaagcc tcataacacc ttttccctcccattctcacc cagagttggg ctggtcacag 8820 ccttgtgatg tgtggagtat aggctgcatcatctttgaat actatgtggg attcaccctc 8880 ttccaggtaa gtgatgggat gtcttacttgactgcctggc attcttctac ctggttcctt 8940 tgttttctgc tgaggacctg cctactcagttctccacatt gcctgccttc ctggcagctg 9000 atccttagca tgcccttttc agtcccatgctcagtttctg tttttgtttc ccagacccat 9060 gacaacagag agcatctagc catgatggaaaggatcttgg gtcctatccc ttcccggatg 9120 atccgaaaga caaggtgaac cttgagggggcactagttaa ctcttttcct tttctctcca 9180 cagaattggt ctatttcaca tcattttcttttttctttga tacctcctct ccccccagtt 9240 actttcagat ggggaaataa gggaattgtaacaagggtga ccttctgatt cctcaacctc 9300 cccttcccct ctagaaagca gaaatatttttaccggggtc gcctggattg ggatgagaac 9360 acatcagctg ggcgctatgt tcgtgagaactgcaaaccgc tgcgggtgag ctgggctcgg 9420 gataaatagt gcccaccgtc cagaagtcacttccttctta gggtggttgc cccctggaat 9480 gctcttcaac aagccagagg gttaggaaaggaggggagga aagctgaaag aagacatctt 9540 tggtcaacag aggaaacata agagggagtggttttgcgga gggaaggagg ttagacagcc 9600 taaccttgag acaaccagag atcaaagcaatgtcctggat tctttaggtc agacagaaaa 9660 gaataaacta cccttgaaga gcttacattttaatgaggaa ctaaagaaga ttcatgaagt 9720 tgacaaggat atacaagtag aaagaactttcaaagattat ggagtaattg tgctagaggg 9780 aaggtaggtt gagctataat atcagaaacgttggtcctgg tgtggtcgtg gttaggtagc 9840 cttcaaattg gttgcaagca gagccttggttctccaagaa tgaaaggtag ggtcttgaag 9900 aaggcagggt ttgtaaggca tcctgcctcacctttttcct gccctcctcc accagcggta 9960 tctgacctca gaggcagagg aacaccaccagctcttcgat ctgattgaaa gcatgctaga 10020 gtatgaacca gctaagcggc tgaccttgggtgaagccctt cagcatcctt tcttcgcccg 10080 ccttcgggct gagccgccca acaagttgtgggactccagt cgggatatca gtcggtgacg 10140 atcaggccct gggcccccct gcatcttttatagcagtggg tgtccagtcc aggacactgg 10200 tgctttttta tacaagagaa cgagccagagttcactcctt cctcctggct ctctatatac 10260 ctgtgaatat gtgaaatagt gtaaatatgaaagaacttgt acctatcact tcaacccctg 10320 ccttgtacat aatactattc catccacacagtttccaccc tcacctgccc cctcatacgg 10380 agttggatgg gggccgagtg aggtaaccaggtggcatcta ccccatgttt tataaggaat 10440 tttgtacagt ctttgtgaaa taaaataacgtgcttcattt gacccccatc cctggagttg 10500 gaggtttggg aatgctgggg tggagggatgaaactattgg caaactttct gagtttgggt 10560 atgaagggag tcctccttac cctccaaaatgaagcacagc caggctacca tttatttccc 10620 ctgtccacct tatcatatgg gagggtagtgatgggtgggg cagcatttct ttcagattaa 10680 aacagagaag tgttatgagg tggcacttctcggatgtgga attatgagag ttgggaagat 10740 ctgactccta gagtcattag gccgcggcccagtatagagc cagaaactca ggttgaaaac 10800 gtgctcaacc tggctcctag gggggattgccaggaccggt cagaaggctc ccgtcgtcca 10860 tctcgggaga ctaggaaggc cggattcctacgcgaggcct gctgggaagt gtagttcgtt 10920 agtggaagga agtcacatgg aagaggggcggtagttggtt gtgggcactg ggttagaggt 10980 atcacgtggg ggcactttcg tcttagcttttggacaagac gcaggcgcaa acccacggct 11040 gctgcggggg atccttgtgg ccctttccggtcggtggaac caatccgtgc aacagagaag 11100 cggggcgaac tgaggcgagt gaagtggactctgagggcta ccgctaccgc cactgctgcg 11160 gcaggggcgt ggagggcaga gggccgcggaggccgcagtt gcaaacatgg ctcagagcag 11220 agacggcgga aacccgttcg ccgagcccagcgagcttgac aacccctttc aggtgacttg 11280 cgccagtcgg cctcttttgg gcggtcaggttgattcttcc cggttctgta gggtcgggct 11340 aacttgtatc cccatttgtg acatttgatcctgggaagag ccgccacgtg gggtgacagt 11400 gactccagac cagtgagcac tctggggggcgggccctgcc ccttaatggg ctggtgctgg 11460 cagtgttgga tgatggattt gaggtaaatgttgtcccagt cctgggacaa cggcgtggcc 11520 ccgaaggtgc agtacttgga gccacaggcagtttgggaat ggtccctggg aattccccta 11580 ggtggcactg ggtgccagct gagacccgggtctctgccct caggacccag ctgtgatcca 11640 gcaccgaccc agccggcagt atgccacgcgtgacgtctac aacccttttg agacccggga 11700 ggtgagctac tggagagcat aggagttcaaggaagggaag ggtttgccag tgagacaagt 11760 tagcttgggt acaggggact tttctgcatgcacaggaagg aagaggcact gtgatcttag 11820 ttccctgaga gaggctagtc agccatgggatgcctcatcc ttctagggcc accagggcca 11880 ccaacccaaa tggtttgagc tctaaaagtcagtcctggga agatctggag gctggcaagg 11940 ccttaaccaa tgttctgtat gggaaggctttcttgaagaa aataaaggtt ggggctggtt 12000 ggagtttagg ttaggtgtgc agtttgaaagcaaggaatga ggaggactac acctgtttgt 12060 cccactggat tagaagatgg tgagttgtaagaatttaagg ggttttcatg gtacttaaaa 12120 cttaaaaaaa aaagccaagc atggtgtgcatgcctgtaat cccacttggg aggctgaggt 12180 gggaggattg cttgagccca ggaatttgatggagaccatc ctgggcaaca gcgagacccc 12240 catctcttga gacgttttaa aaattggctaggtgtagtgg cacatgcctg tagtccttgc 12300 tacttgggag gctgggatgg aaagactgcttgggcccagg agtttgaggc ttcgatgagt 12360 gatgattgca ccactgcact ccagcctgggtggcagagag acaaccccac actccatccc 12420 cagaagaaaa gagaaaaaaa aaacacagttaagtggggag atatggtaaa tgagtgattt 12480 gagtcctcac taaggaattg gatatgagggatgataaggc tgctacaggt catcatagag 12540 ttctctgaca gaaattcaac agaagaactagcttcctaac aaatggggga aaggctgctt 12600 ccttacatca tgagttctgg gtctaaaggtattcaagcag actatagatt cctgtgttat 12660 gagggactag atgatcacta aggtcttttcctcaacagcc accaccagcc tatgagcctc 12720 cagcccctgc cccattgcct ccaccctcagctccctcctt gcagccctcg agaaagctca 12780 gccccacaga acctaagaac tatggctcatacagcactca ggtacaggag gtgtgcaggt 12840 gagggctggg gagaagggcc agtcctggggcccagggctc acatctccgt ctgctcacac 12900 tgggcaggcc tcagctgcag cagccacagctgagctgctg aagaaacagg aggagctcaa 12960 ccggaaggca gaggagttgg accgaagggagcgagagctg cagcatgctg ccctgggggg 13020 cacagctagt aagtaataga gtggggaagagcatctgaga gtttgggagg agcaaaaaca 13080 ggtcttaagg gcctgggctc ggctgggtgcggtggctcac acctataatc ccagcacttt 13140 gggaggctga ggtgggcgga tcacgaggtcacgaggtcaa gaaatcgaga ccatcctagc 13200 caacatggtg aaaccctgtg tctactaaaaatacacacac acaaaaatta gctgggcatg 13260 gtggcgcgca cctgtagtcc cagctactcgggaggctgag gcagggagaa tcgcttgaac 13320 ccgggaggtg gaggttgcag tgatccgaaatacgcgccac tgcactccag cctggtgaca 13380 gaaatgagac tccatctcaa aaaaaaaaaaacacaaacct gggctctgga gtcagactgc 13440 tgagtttgaa ttttagagat actgcttactaaccatgagc ttttagggaa gttacttaat 13500 cttgaatgcc tcagtttccc tatttataaaatgaaaacca cgtttgcatc cgtttgacag 13560 ggttgttgtg aaggttaaac aaaatatatgtgaagtactt ggtacagtgc ttagatgttt 13620 taaataataa taaacagtat taatttttccactctcactt gtctccttgg accccaaatt 13680 ggagataaaa gagaggatcc agggctgggtatggtgcctc acgcctgtaa ttccaacact 13740 ttgggaggcc gaggcaggca ggttgcttgagttcaggagt tagagaccaa cctggacaac 13800 atagtgagat cctgcctcca aaacattaatgaaaaaaatt agcagggcat gatgtacctg 13860 tctgtagtcc tagctcctca agaagctgaggtggacggcc aggcgccggg gctcatgcct 13920 gtaatcccag cactctggga ggccaaggcgggtggatcac ctgaggtcag gagttcagga 13980 ccagccaaca tggtgaaacc ctgtctctactaaaaataca aaaattagct gggcgtggtg 14040 gcatgtgccc ataatcccag ctactcaggagactgaggca ggagaattac ttgaacccgg 14100 gaggcggaga ttgcactgag ccaagatcacaccactgcac tccaacctgg gcaacaagag 14160 cgaaactcta tctcaggaga aaaaaaaaaaaaaaaaaaag ctgaggtggg agggttgctt 14220 gagcccagga ggttgaggct acagtgaaccatgatcatac cactaccttc cagcctgaac 14280 aacagagacc ctatctcaaa aaaaaaaaaaaaaaaaaaag agaggatcca gggatggaga 14340 gaagggggag tgattccttt gttggtctctgttttacttc tggggtccac ctgttttttg 14400 gcgccttaca gctcgacaga acaattggccccctctacct tctttttgtc cagttcagcc 14460 ctgctttttc caggacatct ccatggagatcccccaagaa tttcagaaga ctgtatccac 14520 catgtactac ctctggatgt gtgagtagtgagaagccttt tggaggaagt tacaggtaga 14580 tctcttaact gccctggggt ccgctcaccaagaaccaaac acttcacctc tatttagaac 14640 tcaccagcct gctagcaaat gttcttgccctctccccact ttttattgtc catatgcagg 14700 aatatatttt gaattcttta gatgtctttgggctgggtgc agtggtatac gcctgtaatc 14760 ccagcacttt gggaagctga ggtgggtggataacctgagg tcagaagttt gagactagcc 14820 tgatcaacat ggagaaaccc catctctactaaaaatacaa aattacctgg gcgtggtggc 14880 acatgcctgt aatcctagct actcaggaggctgaagcagg agaatcactt gaacccggga 14940 agtggaggtt gcaatgagcc aagatcatgccattgcactc cagcctgggc aacaagagca 15000 aaactccatc tcaaaaaaaa aaaaaaaaaaaaaaaaaaaa aaaaaaaggc cgggcacggt 15060 ggctcacacc tgtaatccca gcactttgggaggcccaggt gggcagaaca tgaggttagg 15120 agatcaacac catcctggct aacacggtgaaaccccgtct ctactacaaa tacaaaaaat 15180 tagccgggcg tggtggcggg tgcctgtagtcccagctact taggaggctg aggtaggaga 15240 atgggcgtga acccaggagg cagagcttgcagtgagccga gatcctgcca ctgcactcca 15300 gcctgggcga cagagcgaga ttccatctcaaaaaaaaaaa aagaaaaaaa aaaattagct 15360 gggcgtggtg gcgggcgcct gtagtcccagctactctgga ggctgaggca ggagaatggc 15420 gtgaaccggg gaggcggagc ttgcagtaagctgagattgc gccactgcac tccagcctgc 15480 gcgacagagc cagactccgt ctcaaaaaaaaaaatgtctt tgcaagggga tcacacatta 15540 ttgacatttg ctttctccat ctcccctgttgaggattcca tgaaggcagc agctgccttc 15600 atttccttac tctgccatgt ttggtgaatattataggatg agcatagatg ggaaggagcc 15660 ttcattgcag tccagaaggg ctcctcatcctgtccctctg ccccttaggc agcacgctgg 15720 ctcttctcct gaacttcctc gcctgcctggccagcttctg tgtggaaacc aacaatggcg 15780 caggctttgg gctttctatc ctctgggtcctccttttcac tccctgctcc tttgtctgct 15840 ggtaccgccc catgtataag gctttccggtaagtgtgtta gtggtgggag agtgatggag 15900 acctgggatg ggccccacgt ctgcccatccttcagctcta attcttctcc caccctcccc 15960 atttttttcc tctttgtagg agtgacagttcattcaattt cttcgctttc ttcttcaatt 16020 tcttcgacca ggatgtgctc tttgtcctccaggccattgg tatcccaggt tggggattca 16080 ggtttgtgag gctgttatcc accctcacctttccctctag atccagccag cactgggtgc 16140 tggataggag ttgttcagaa aaaggaaatgtggttttaat ccttgggagg tactagttta 16200 atgagataca agacatactt ccaggatagagcgcagacag cactcttgac acatatagat 16260 tgaagggaag aatgctgcat ttggccaatctaaggtggct tcctggagga ggcatagaac 16320 catggttgaa aggaggaaga agaatccccagatgagtgcc tggaagagct tggagagctc 16380 agggctaatg gttcagaaac tggaattaaactatgaagag attagataca gtttgggatt 16440 gtggtgcttg gaatgctgct cgattgacagggagccactg ctgatgggaa gggtaggaac 16500 aggggatgct ggtgacataa ccagtggagaagctgaggag cccctcttca ctggtacatc 16560 cttcccttta cagtggctgg atctctgctctggtggtgcc gaagggcaac acagcagtat 16620 ccgtgctcat gctgctggtc gccctgctcttcactggcat tgctgtgcta ggaattgtca 16680 tgctgaaacg ggtgagggct gtgtcgaaggtggggccggg atggtgagat catgggtccc 16740 caggggcgtg ggtggaacat tcaggagcaactggcacagg tcaggctgct gggttgttct 16800 cagctaatgg acctctgggg tgtgtgtttctgtgtgtgag tgtgtgtgct gggcagcagg 16860 ctgctgagtg gtagtgatgc tgttaggctgggggtgggga accagtggct ggaatgggcg 16920 gtaatgtctt tgtcctctac ttgcagatccactccttata ccgccgcaca ggtgccagct 16980 ttcagaaggc ccagcaagaa tttgctgctggtgtcttctc caaccctgcg gtgcgaaccc 17040 gagctgccaa tgcagccgct ggggctgctgaaaatgcctt ccgggccccg tgacccctga 17100 ctgggatgcc ctggccctgc tacttgagggagctgactta gctcccgtcc ctaaggtctc 17160 tgggacttgg agagacatca ctaactgatggctcctccgt agtgctccca atcctatggc 17220 catgactgct gaacctgaca ggcgtgtggggagttcactg tgacctagtc cccccatcag 17280 gccacactgc tgccacctct cacacgccccaacccagctt ccctctgctg tgccacggct 17340 gttgcttcgg ttatttaaat aaaaagaaagaggaactgga actgacatcc ccgtttcctg 17400 aatcttcatt gggaattagg ccttatgaaagagaaaggag agtgtggagt gaggtgggaa 17460 ggtcggaccc ggctttagtg taaactgggagagagatgag gggcggggcg gtggagcccg 17520 agtggctggc gcaggaagga ggtgggaagtccacggaaac gcgaaacccg gagacgccag 17580 ggagccctgc tccccacccc tctccattaatgacggggaa gagccaccgc ctctgccggg 17640 aacgccaagg aatacgcggg cctggagcctgaaaagctgg atggggctca agtggaggcc 17700 caaaggatca ctagcagcct agccagggtccagagcgagg cagggactgg aggagcgctt 17760 atccgactgc ctcgccctgc cgcgggatcccccaaccccg acagggtctc agtcccgaac 17820 tacaactccc ggggtgcacc gcgccggccctcgccgccat gcccctcctt cccgcaccat 17880 ccccattccc atcccccttc tctagtccccgacctgcggc agccggagct cggggagcgg 17940 agcgtggtgg ggaggggagc gggacaggcgacacaggaga cagcggcgcc gcggcctctc 18000 cccaccaggc ggccccggat cctactggacgccctgaggg cacaccgacc gcgcctctag 18060 agtcacccca cgccgacccc tcccctcttctctagactta tttccatcct tcccgctttt 18120 accctcccca cccgtccctg ggctccaggccgccgccccc tcctcactcc tggaccggcc 18180 cttctcggtg cccctcttcc ctagggagatgcgatgagcc ggtgcccccg cgtcctcatc 18240 gtcgccccgg gcacggtgcc cgtccagtgcccgtggtggg gagggagcac tccgcggtcc 18300 ctccgtgacg cccctcgctt ggccccccccacagctggcg tccctcggcc atgccccagg 18360 ggacccagcc agggggtggg ctctagagcgagtggggtgg agaggagaaa ggacggggcc 18420 ttgggggcct ctgagatgct cccaagtgccagggagggcc gagcgaggcg caggcaaccg 18480 ggcagcaggc atgatgccct cgcctagtgactccagccgc tcgctgacca gccggcccag 18540 caccaggggc cttacccacc tccgcctccaccgaccctgg ctgcaggccc tgcttacgct 18600 ggggctggtc caagtgctcc tgggcatcctggtggtcacc ttcagcatgg tggcctcttc 18660 cgtcaccacc accgagagca tcaagaggtcctgcccgtct tgggctgggt tctcggtgag 18720 ttgggtgcac agttgttggg tgggggaggctcctcgggcc ccaccctcca caccagctgc 18780 aagtccacat gcttgcctct cctccctctcctggtcctct gcctccttac agggctgggt 18840 gcatcctgtg gtgaggggct cactcaggagcctccccctg ccaggctgag cctgtgccca 18900 ctctgtcccc agctggcgtt ctccggggtggttggcattg tgtcctggaa gcggccattc 18960 actctagtgg taggtgccag ggtccagtgcccactgggag gcaggtgccc agcacgcaag 19020 gggaagccca tttatgacct caagaagggaactggctccc cagttggtgc cagccgggtg 19080 ggcacgaagt tctgtgaagg agggggactctgtccgtggc agaggagtac tcatgagggt 19140 cccagccctg agtctgcacc ctgttttccccagatctcct tcttctcctt gctttcggtg 19200 ctctgtgtca tgcttagcat ggctggctctgttctctcct gtaagaatgc tcaactggcc 19260 cgagacttcc aacagtgctc tctggtgagatttgaggagg gagagctgga aagaactggc 19320 tgggggaggt gtgcaggaca cctcagtttgtgctgactca ggctgcctca ccctccctgc 19380 tccactcagg aaggaaaggt ctgtgtgtgctgtccctctg ttcccctcct ccggccctgt 19440 ccagagtcgg ggcaggaact gaaagttgcccctaactcca cctgtgatga agcccgaggg 19500 gccctcaagg tgagcttgca ccctgcaaacatcctcctgg ttctcactct tgcctcccct 19560 gctggggatc tcacaggccc ataatgtgtggaacttagcc gtctcctgac tcctctcctc 19620 ctttcccttc cccagaacct gctcttcagcgtctgtgggc tcaccatttg tgccgctata 19680 atctgtacac tctctgctat tgtctgctgcatccaaatct tctccctgga cctcgtgcat 19740 acggtgagaa gggagcaggg gccagggcacgcaggtatgg tgggggcagg ggtgtgtgtg 19800 ctgagacttg cctgagggaa caatggctacagatctggct tttgagcacc aggggctatg 19860 ggttatctat tatcctcatc tattggaggaatggatgttt agtggggagg atgagaaggg 19920 gagatggcag ggagtgagtt aagtatgtgatgctggtctg ggaccaggag agggtgcttt 19980 agagaatggg actggatggt ggggtagagtcaagaaggtc ctatgctggg cacggtggct 20040 cattcctgta atcccagcac tttgagaggcccaggcaggc ggatcacctg aggccaggag 20100 ttcaagaccc acctggccaa catggcgaaaccctgtctct attaaaaata caaaaatcgg 20160 gccgggcgca gtggctcgtg cctataatcccagcactttg ggaggctgag gagggcagat 20220 cacctgagat caggagatcg agactatcctggctaacatg gtgaaacccc atctctacta 20280 aacatacaaa aaattagcca ggcgtggtggcgggcgcctg tggtcccagc tactcgggag 20340 gctgaggcag gagaatggca tgaacctgggaggcagagct tgcagtgagc tgagatagtg 20400 ccactgcact ccagcctggg cgacagagcaagactccatc tcaaaaaaca aacaaacgaa 20460 atacaaaaat tagccaagtg tggtggcgggtgcctgtaat cccagctact tgggagtctg 20520 aggcaggaaa attgcttgaa ccggggagtcagaggctgca gtgagctgag atggtgccac 20580 tgcattccag cctgggcgac aagagcacagactccgtctc aaaaaaaaaa aaaaaaaaaa 20640 aaaaaaaata tatatatata tatatatatatatatatata cacacaaaat tacaaaaatt 20700 aggctgggct ccgtggctca tgcctgtaatcccagcactt tgggaggcca aggcaggagg 20760 atcaccagat atcaggagtt tgacaccagcctggccaaca tggcgaaacc ctatctctac 20820 taaaaataca aaaattatcc gggtgtggtggcgggtgcct gtaatcccag ctactcggaa 20880 gactgaggct ggagaatcgc ttgaacctgggaggcagagg ttgcagtgag ctgagatgta 20940 gccattgtac tccagcctgg gcgacaagagtgaaacttcg tctcgaaata ataataataa 21000 taataattag ctgggcatgg ttgcacacccttataattcc tgctactcag gaggatgagg 21060 catgggaatt gcttgatctt gggaggtgggggttgcagtg agctgagatc gcgccactgc 21120 actccagcaa cagagtcaga ctctgtctcaaaaaaaaaaa gaaggtccta gatggaggtg 21180 aggctaaagg gtggacattc ctgtgaagacacagaatgtg tggagcttct ggatggaggt 21240 ggggccataa agaggaattt gtgggtgggcagggccagag gctagaggta cagggtgacg 21300 gtggggttaa ggagagctga ttttgggtgagggaggggcc agaataagag cttctctcag 21360 gggatagggc cagaaaaaat aatgtagatgagagaggagt ttggggccca ggaggagctg 21420 tattcccaga atccagactt gctgacctgctcgcttcccc agcagctggc ccctgagcgg 21480 tcagtctcag gcccactggg acctctgggctgcacgtccc cgcccccagc ccctctccta 21540 cacaccatgc tggacctgga ggaatttgtcccgcctgtgc ccccaccgcc ctactatccc 21600 ccagagtata cctgcagctc agaaacagatgcacagaggt aaggcctggt ggggtcttgc 21660 tgggggagct aaggaagggg actggggctgggcctggatt gctggagaga gggatctttg 21720 tggaggggga attatttctc ctacattgaccctcttcctc atctgccagc atcacgtaca 21780 atggctccat ggacagccca gtgcccttgtaccctaccga ttgcccccct tcttatgagg 21840 cagtcatggg actacgagga gacagccaggtgagagcaca gcacggcttg gggcgggctg 21900 gggagccggg ttgtagcctg gaaagctgaacaggctgtag cctctggata aagatagtaa 21960 cagtggtcaa ggtctttaca gcaccagcatgcccactgtt gcctttgatc ttccctagag 22020 cctggtgagg tgagtgggtt ggaagctatcatttctgcca tacagacgtg gaagctgagg 22080 ctgcacaatt aagtgacttg tacaaagggacaaggctggt ctggaatcta agtctccaga 22140 gctctgtcta gcacatgggg gtgttcagtgtgtagggctg aacccttgac cctgtgtctt 22200 ctgcaggcca ctctctttga ccctcagcttcacgatggct cgtgcatctg tgaacgagtg 22260 gcctccattg tagacggtga gcagggcgtaatgaggggtg gacaagggcg gggctgccag 22320 ggatagctgg ggtgggtaga gacaatagaaggggaaaaca aggcggagtt ggtggtttgg 22380 ggacatagga gggctgtggc aacttggaaccctggactta tttttctcct ctgagataaa 22440 gctggaggca cagtggcccc tagaggctgggcttgggaga agaggaactg cctgggcagg 22500 gctaggccag gccaggctgg tgctacacagcgccccctgc cgcccacagt gtccatggac 22560 agcgggtctc tggtgctgtc agccattggtgacctccctg ggggctctag cccgtcggag 22620 gactcgtgcc tgctggagct gcagggctccgtgcgctccg tggactacgt tctctttcgc 22680 tccatccagc gcagccgtgc cggctactgcctcagcctgg actgtggcct gcggggcccc 22740 ttcgaggaaa gccccctgcc acggcgccccccacgggctg cccgctccta ttcctgctct 22800 gcccctgaag ctccaccccc actgggtgcccccacagctg cccgcagctg ccaccggttg 22860 gagggctggc cgccctgggt gggaccctgcttccccgagc tgaggcggcg ggtcccccgg 22920 ggagggggcc gcccagccgc agccccgcccacccgagccc cgactcgtcg cttcagcgat 22980 agctcaggtt ccctcacccc accggggcaccggcctcctc atccggcatc cccaccaccg 23040 ctgctgctgc cacggtccca cagcgacccaggcatcacga cctccagtga cactggtgag 23100 ccccctcccc gactgcccag gctcaggagagggtaggcac tgggagttag gtggccagtg 23160 atgcccacca ggattggggc acagttgaggtgggtgagga ggaagaaagg gtgagttcac 23220 tcctctggta gacattctca ggtactcacctcctagggac agaacatcca tagcttagca 23280 gctaaaaaag gagaattttt ttttttttttgagacggagt ctctggctct gtcgcccagg 23340 ctggagtgca gtggcatgat cttggctcactgcaacccgt ttcccgggtt caagtgactc 23400 tctcctgctt cagcctgccg agtagctgggactacaggtg tgcgccacca tgaccggcta 23460 atattttttt tttttttttt ttttgagacggtgtctcgct ctgtcaccca ggctggagtg 23520 cagccgcgcg atctgggctc actgcaactccgcctcccgg gttcacgcca ttctcccccg 23580 ggttcacgcc attctccttc ctcagcctcccgagtagtga gtagctggga ctacagatgc 23640 ctgccaccac gcccggctat tttttgtatttttagtagag atggggtttc accgtgttat 23700 ccaggatggt ctcaatctcc tgacctcgtgatctgcccgc ctcggcctcc caaagtgctg 23760 ggattacagg catgagtcac cgtgcccgggcaatttttgt attttttagt agagacaggt 23820 ttcacccttt tggccaggct ggtcttgaactcctgacctc aagtgatcca cccgcctcgg 23880 cctcccaaaa ggatttattt tttgaaaccagttccacagc tctcagcttg gtccacttat 23940 ctgtcctccc caagcttcag ctgtcacttgttaacatgta taataatagt acttcacgcc 24000 gggcacggtg gcttgcacct gtaatcccagcactgtggga agctgaggtg ggtggatcac 24060 ctaggtcggg agttcgagac cagcctggctaacatggtga aaccctatct ctactaaaaa 24120 tacaaaaatt agccaggtgt ggtggagcgcgcctgtaatt ccagctacta acgagagagg 24180 ctaaggcagg agaatcgctt gaacctggaaagcaggggtt gccgtgagcc aagatcatgc 24240 cactgcactc cagcctgggt gacagagacacactccatct caaaaacaca aacaaacaaa 24300 caaaaaacat gtataataac agtacttcagccataggcat tgtactcgaa gatgctgaga 24360 aagaacagtg gcgggcaagg ctgttcacacctataatccc agcactttgg aggccaaggc 24420 aggtggatca cctgaggtca ggagttcaagaccagcctag ccaacatggt gaaaccccca 24480 tctctactaa aaattgaaaa attagctgggcctggtggtg gacgcctgta atcccagcta 24540 ctagggaggc tgaggcagga gaatcgcttgaacctgggag gcggaggttg cagtgagctg 24600 agatcgtacc actgcactcc ggcctgggcaacacagtgag actccatctc aaaaaaataa 24660 gaaaggagat agtactgggg aacgctcagcactgtgcgcc aggtgctgaa caacaccact 24720 gcagtccttg ttgtggtgga ttgtaccatctagttgctgg ctaatatgga cagagatgct 24780 ggccctttga ttggggatgg agcgtgggagctgtgaaagc tcctctgggc ttgagttccc 24840 acaggagggt gggcgtgtcc acagaacacttccactcact ccctgtctcc ctttctctct 24900 tctccccagc tgacttcagg gacctttataccaaagtgct tgaggaagaa gctgcttctg 24960 tttcctctgc agatacaggt caggcatgtggtttgcgccc cagggatggg gattgggcat 25020 ggctgcccag ccccctctcc accctacaataccattctct tatctctgtc tctctgcagg 25080 gctctgctct gaagcctgcc tcttccgcctagcccgctgc ccttccccca agttgctacg 25140 tgcccggtca gccgagaaac ggcgccctgtgcccaccttc caaaaagttc ccctgccctc 25200 gggccctgca cctgcccact ccctgggggacctaaagggc agctggccag gtcggggcct 25260 ggtcactcgt ttcctccaga tatccaggaaagccccagac cccagtggga ctggagctca 25320 tggacataag caggtaggaa ttcggggagccaggaaagat gtttgggaaa gcgtggagct 25380 tcagattgag ccttattgat gatgccctttcttgtgtccc tgtccaggtg ccccggagcc 25440 tgtggggccg gcctggccga gagagcctccaccttcgcag ctgcggagat ctgagctcta 25500 gctcttccct gcggcgtctc ctgtctggccgcaggctgga gcgtggtacc cgcccccaca 25560 gcctcagcct caacgggggc agccgggagactgggctctg acctaggctt cttgtcacac 25620 tgaacacatc cagccacagg caccagctggttgggaccag cagcccccag catcctcttg 25680 cactggctgg cacaaaaaga aacctgctgtatacccccca aagtgtccct ttccctccta 25740 cctctggggt ctcttgctgc ttgcctctgctgctctggtc tgggagagct tctgtcctgt 25800 gctgcatggg tatttagact gtgggggagatgccccttct tatagcactg gaggaggaaa 25860 acaaattctt gtccccctca gaatgagagtggctctttct gatttgcaag ggcactatgg 25920 tcagggcaaa ggcatggccc aggtgtttaagtacagggtg acgtgtgcct atgcaatggg 25980 gtggtaaggc aggcacgaag agtccaaaaaatctaggtgg cctctcagct ctgccacctc 26040 tagctgcatg accttgggca agctatgtaaccccaattgc ctgctccatt aaagactgtg 26100 aaggtagaat gtttgtaaag ctcttaacagtatgtaagcc ttcaataaat ttcagttttc 26160 cccttgtttt cttgatcatt ctctgtcaccagtgaaattt gttctagtgt ctctcatatt 26220 taagaaaact ctttcaggac tgggtatggtggctcacacc tataatccta gcactttggg 26280 aggccgaagc aagaggatcg cctgagcctaggaattcaag accagcctgg gcaacatagt 26340 gagaccctgt ctctacaaaa aacaaaaaattagccaggca tggtgggaca cgcctgtagt 26400 cccaactact caggtggcta aggtgagaggatcacttgag cttgggaagt ccaggctgca 26460 gtaagctgtg attgagccac tgcactacagcctgggcaac agagcaagac catgtctcaa 26520 aaaaaaaaaa aaaagaaaaa agaaactttcaagacactct ttccaaccac taattgtaac 26580 tctgctcctc cttttcacag caataggttttctttttctt ccctccactg ttaaacatcc 26640 attctctcct cacccacccc catcagactccttcccctat ctttccacag ccactgctct 26700 gaccaaactt tccagtgacc acagtggtgtcagacccagt gaccatttct ctgcctgcat 26760 ctcacttgac ctcgaggcag caattaatacccataatcag catcttcttg aatttgtccc 26820 tttgaaaagg gaaatattgg ctcttctactttgtcctgct gaactgctta acattggagg 26880 gccccagggc cctcacctaa gccctctttcctacctccac tctttctata ggtggcccta 26940 ctactaaagt ccatggcttt aaataccatctttctatgtg ttaatccatg actccagcct 27000 tgacctccca tgagcgccat ccaactcagcatgtctgctt ggatgtctaa tgggcatttc 27060 agattcaaca tggccacaac tgaactcttgattcccaccc cagcaccggt tatttttcca 27120 ctgttcccat ctcaatggca cctccattacccatttgcac attccaaaag ctcaggaacc 27180 atggtgactt cttttcccat atccaacacaaccaatccta tcctgaattc atccacatcc 27240 caccacctcc ccagctacct agctccagccatcctctctc cacaacctct gaatcagtct 27300 ttcacttttc ccagcaatcc attctccactcagcaaaatg atgataaagc acgtcacatc 27360 aaggctctgc ctcaatttaa tggcttcccattgtatttag aatcatctcc aaactcccag 27420 agactatggt cgagctacaa tctggcccaccttctgttcc agccaaattt cctcacagca 27480 caaggacgtt tgcacctgct gttttgccaagcatgaaacc cttggcccct atatctggtg 27540 ctatcaccta atatcaggtt ttagctccattctcaccatt tcagtgagca cccaatcccc 27600 atcgcagtca ttctatcaca tagccatgtttttttttgtt tgtttgtttc attttgtctt 27660 tttttgagac agggtcttgc tttgttacccaggctggagt gcagtggtgt gatttgggct 27720 cactgcaacc ttccacctcc tgggtcaagcaattctcctg cctcagcctc ccgagtagct 27780 gggattacag gcgcccgtcc ccatgcccgcccagctaaat tttgtatttt tagaagagat 27840 agggtttcac catgttggcc aggcgggtctcaaactcctg acctcaagta atccgcctgc 27900 ctcggtctcc caaagtgctg ggattacaggtgtgactcac cgcgcctggc cacataccca 27960 tggtttcagc atgtatcact atctaaaattattatttttg tttatatatc tgtgtcgtcc 28020 catagaaggt taaggtccca agatcagaaacttgctcatt gcagtgggtc taacactcag 28080 taggtcctca acaaacattc gttaagatactaaagtggca gggtggggcc ctgtaaacag 28140 cttcaggagc ttcaggaccc tgtgcttgtaggggcaacgt ggtgccctcc aaggaagaca 28200 gggaggtggg aggagcactg cccagagatggcgtcaggct gcaagacttc ttgaataatt 28260 cagcatcata acaacccagc ctcaggaagggatagggcac ggccaggacg aaacattagg 28320 aggcgatgga caatgggatt cccacggggcagcttctgcg cactggacgt tccctaacct 28380 gaggctctct aaagaggaag gttaggaatcctctgagctt cggtgggctg gactcactgt 28440 gggaattcaa tcgcccccat ccaccaacagtgtgctggcg ggaaaacgcc gacacgcatg 28500 cgtagttctc gcgccggctc ctctctctctctctctctct ctcgctcgct ctctcgctct 28560 ctcgctctct ctcgctcgct ctctcgctctcgctctctct ctctctccgg ctcgccagcg 28620 acacttgttc gttcaacttg accaatgagacttgaggaag ggctctgagt cccgcctctg 28680 catgagtgac cgtctctttt ccaatccaggtccgcccgac tccccagggc tgcttttctc 28740 gcggctgcgg tgatcggtcg ggctgcatcctgccttcaga gtcttactgc gcggggcccc 28800 agtctccagt cccgcccagg cgcctttgcaggctgcggtg ggatttcgtt ttgcctccgg 28860 ttggggctgc tgtttctctt cgccgacggtaggcgtaatg aatatttcga cctttggatc 28920 ttagctgtcc cctccctgcg ttcgcacttaacctttttca ccattattat tattattgtt 28980 attattatta ttttttgagg gagtctcgccctgtcgccca ggctggagtg taatggcgcc 29040 ttcttggctc actgcaacct ccgcctcccgggttcaggcg attctccgac ctcagcctcc 29100 caagtacgtg ggattacagg cacccgccaccacgcacggc taattttttg tatcttttag 29160 tagagacggg gtttcaccat gttggtcaggctggtctcca attcctgacc tcgtgatccg 29220 cccgcctcgg cctgccaaac agctgtgattataggcgtga gccaccgcgc ccggccaacc 29280 atcattatta tttttaacgg taaggatggtcagattttac taatgaagaa gagattataa 29340 aatcttcaag tctttatatc cacttgctttttgaggggtg gagtgggaag aaggttatgt 29400 aattcatacg ttcttcagag atgtgacaaacattcacgga gccggacgac gtcgggttgg 29460 attcgcactg gagctgcaga tgggtgccaggatggactgg tccctaccct ccgcttgaac 29520 ctaggaggcg gaggttgcag tgaaccgagatcgtgccact gcactccagc ctgggtgaca 29580 gagatactcc gtctcaaaaa aaaaaacaaaacaaaaaaca agcggactgg gcgcagtgcc 29640 tcaccctgta atcccagcac tttgcaaagccaaggcggga ggatcctttg agtttaggag 29700 tttgagacca acctgcgcaa cacagtaagaccccgtctct acaaaaaata cagaaattag 29760 ccaggtgtgg tggtgtgcgc ctatagtcccagctattctg gaggctgagg tgggaggatt 29820 gcttattctg gaggcagagg ttgcactgagccgaaatcaa gctactacac tccatccagg 29880 gcaacatacg gagaccctgt ctcaaacaaacaaacaaaaa attgctcagt acctggccaa 29940 aaaagaagag gctcactatg cagaggggaagtggaaggag atgtttggac ttctaaactc 30000 aatagagcag gagaggcaaa tgtagaatgtgctcaggaaa tatctgtgag atgaatgaac 30060 ttgagggaag taaggtacta gatattacctgccctaccca gaacaaatcc tgtgcaatgt 30120 ttccttgaaa agtgagaagt ctggaaggggtggctactga catagtgaag caactagttc 30180 aattctacaa cttgacagct accctgtgccaggctatcta cgaggatact tagaatgcat 30240 aagacattcc ttcaaggaac tccaggaacagaggcctgac atgttgcaat gtttagtgtc 30300 aagcagtgta ctagagacac attatcacactcaaacctca caacaattct gtgaggtagg 30360 agttatcact ccccttttat agatgaaacagaggcttaga gtgattgatt tattgaaagt 30420 caaacagcca gtaaatggtg tagccaggattccaaacttg ctgtctcact gagactgtac 30480 ttaattactg gagggaccgg gtgtggtggctcattgctat aatcccaaca ccttgggagg 30540 ctgaggctgg tggatcacct gaggtcaggggttcgagacc agcctggcca acatggtgaa 30600 accccatctc tactaaaaat acaaaaattagctgggcatg gtggtgggct cctgtaatcc 30660 cagctactca ggaggctgag gcagggcaattgcttgagcc gagatcacac tgcactccag 30720 cctgggcaac agggcaagac tctgtctcaaaaccaaaaaa aaaaaaatta ctggaggaac 30780 ctagaagaag aaatgatcaa ttttgcttggagtgtatcta gaaagacttc actgagatca 30840 tttaaagaac aaaaaggatg gctggggtccaggcagtggc tcatgcctgt aatcccagca 30900 ctttcggata ccaaggcagc agatcacctgaggtccagag tttcagacca gcctggccaa 30960 catagtgaaa ccccatctct actaaaaataaaaaaattag ctgagcatgt tggagggcac 31020 ctgtaatccc agctacttgg gaggctgaggcaggagaatc actcgaaccc aggaggtgga 31080 ggttgcagtg agccaagatc acgccactgcactccagcct gggcaacaga gtgagactct 31140 gtctcaaaaa acaacaacaa caaaaaatacaaacaagaga caagtagttc ccaggtgcct 31200 accaagtggt caggcactgc acttacctcactgactgcag taaccaccct ttgaggttgt 31260 ggcattgctc cattttccag gcaaggaaatgggctgagag ctgggattag tcaggtcatg 31320 actgtgtgtg ccactcccgc taaatctcatttgatgtggt tcatgaggcc acaccatgga 31380 cagcttcctc cttgtgtcca ctgaggatatggctttgtac aacactttgg tttttgaacg 31440 actttacaaa cctccctgtc ttgtgaggaaggaagaacag ttattaccat ctgcatctga 31500 tgatgaaaca agggacgctg cagaggagccgcactgacca ctccctccct ccagtcctgt 31560 catcccactg ccagtgtccc accctcttgtgccctgcact tcactggcta ataacccccc 31620 tcactttttc ctctgtgaag ccatcctggataattcccca cccacgaatg gtccctcctc 31680 atctcagaga gctctccatg cacacctgttaccgtttctg tctttatctg taaatatctg 31740 tgtgtctgac ttccatgcct cacacacctctatagggcaa agactgtctt aaacatcttg 31800 gtagtgtcag tattttgcac agtgaagtttttttttttaa attatatcag ctttatttgt 31860 acctttttga catttctatc aaaaaagaagtgtgcctgct gtggttccca tcctctggga 31920 tttaggagcc tctaccccat tctccatgcaaatctgtgtt ctaggctctt cctaaagttg 31980 tcacccatac atgccctcca gagttttatagggcatataa tctgtaacag atgagaggaa 32040 gccaattgcc ctttagaaat atggctgtgattgcctcact tcctgtgtca tgtgacgctc 32100 ctagtcatca catgacccat ccacatcgggaagccggaat tacttgcagg gctaacctag 32160 tgcctatagc taaggcaggt acctgcatccttgtttttgt ttagtggatc ctctatcctt 32220 cagagactct ggaacccctg tggtcttctcttcatctaat gaccctgagg ggatggagtt 32280 ttcaagtcct tccagagagg taagagagagagctcccaat cagcattgtc acagtgcttc 32340 tggaatcctg gcactggaat ttaatgaatgacagactctc tttgaatcca gggccatcat 32400 ggctctttga gcaaggcaca gatggagggaggggtcgaag ttgaaatggg tgggaagagt 32460 ggtggggagc atcctgattt ggggtgggcagagagttgtc atcagaaggg ttgcagggag 32520 agctgcaccc aggtttctgt gggccttgtcctaatgaatg tgggagaccg ggccatgggc 32580 acccaaaggc agctaagccc tgcccaggagagtagttgag gggtggagag gggcttgctt 32640 ttcagtcatt cctcattctg tcctcaggaatgtcccaagc ctttgagtag ggtaagcatc 32700 atggctggca gcctcacagg attgcttctacttcaggcag tgtcgtgggc atcaggtgag 32760 tgagtcaagg cagtggggag gtagcacagagcctcccttc tgcctcatag tcctttggta 32820 gccttccagt aagctggtgg tagacttttagtaggtgctc aataaatcct tttgagtgac 32880 tgagaccaac tttggggtga ggattttgttttttttcttt tgaaacagag tcttactctg 32940 ttgcctgggc tggagtgcag tggtgcaattttggctcatt ccaacctctg cctcccagat 33000 tcaagcgatt ctcttgcttc agcttcccaggtagctggga ttacaggcgg ccaccactac 33060 gcccagctaa tttttgtatt tttagtagagacggggtttc accatgctgg caaggcaggt 33120 ctcaaactcc tcacctcagg tgatccgcccacctcggcct cctaaagtgc taggattaca 33180 ggtgtgagcc cctgcgcccg gccaaggggtgaggaatttt gaaaccgtgt tcagtctctc 33240 ctagcagatg tgtccattct ccatgtcttcatcagacctc actctgcttg tactccctcc 33300 ctcccaggtg cccgcccctg catccctaaaagcttcggct acagctcggt ggtgtgtgtc 33360 tgcaatgcca catactgtga ctcctttgaccccccgacct ttcctgccct tggtaccttc 33420 agccgctatg agagtacacg cagtgggcgacggatggagc tgagtatggg gcccatccag 33480 gctaatcaca cgggcacagg taaccattacacccctcacc ccctgggcca ggctgggtcc 33540 tcctagaggt aaatggtgtc agtgatcaccatggagtttc ccgctgggta ctgataccct 33600 tattccctgt ggatgtcctc aggcctgctactgaccctgc agccagaaca gaagttccag 33660 aaagtgaagg gatttggagg ggccatgacagatgctgctg ctctcaacat ccttgccctg 33720 tcaccccctg cccaaaattt gctacttaaatcgtacttct ctgaagaagg tgaggaggaa 33780 ggggacaaga tgacatagag ccattgaaacttttcgtttt tcttttcttt ttttaaaatt 33840 tttttgaggc agaatctcac tctgcccattctgtcggcga gacaggagtg cagtggtgtg 33900 atctcccctc acagcaacct ctgcctcccaggctatagtg attctcctgc ctcagcctcc 33960 tgagtagctg gaattatagg cgtgcgccactaccacctgg ctaatttttg tatttttagt 34020 agagacaggg tttcatcatg ttgaccaggctagtcttaaa ctcctgacct caaatgatat 34080 acctgccttg gcctcccgaa gtgctggaattacaagtgtg agccaccgag cccagcagac 34140 acttttcttt tttctttttt tttttttgagacagagtctc gcactgtcac ccaggctgga 34200 gtgcagtggc acaatctcag ctcactgcaacctccacctc ccgggttcag gtgattctcc 34260 tgtctcagcc tctcgagtac ctgggattacaggtgcctgc caccacgccc ggctaatttt 34320 ttgtattttt agtagagaca gggtttcactatgttggcca ggatgattgc gaactcctga 34380 cctcgtgatc tgcccacatc ggcctcccaaagtgctggga ttacatgcgt gagccactga 34440 cacttttctt tgccctttct ttggaccctgacttctgccc atccctgaca tttggttcct 34500 gttttaatgc cctgtgaaat aagatttcgccgcctatcat ctgctaactg ctacggactc 34560 aggctcagaa aggcctgcgc ttcacccaggtgccagcctc cacaggttcc aacccaggag 34620 cccaagttcc ctttggccct gactcagacactattaggac tggcaagtga taagcagagt 34680 cccatactct cctattgact cggactaccatatcttgatc atccttttct gtaggaatcg 34740 gatataacat catccgggta cccatggccagctgtgactt ctccatccgc acctacacct 34800 atgcagacac ccctgatgat ttccagttgcacaacttcag cctcccagag gaagatacca 34860 agctcaaggt aggcattcta gctttttcaggccctgaggg ccctgatgtc tgggggttga 34920 gaaactgtag ggtaggtctg cttgtacagacattttgtcc cctgctgttt tgtcctgggg 34980 gtgggagggt ggaggctaat ggctgaaccggatgcactgg ttgggctagt atgtgttcca 35040 actctgggtg cttctctctt cactacctttgtctctagat acccctgatt caccgagccc 35100 tgcagttggc ccagcgtccc gtttcactccttgccagccc ctggacatca cccacttggc 35160 tcaagaccaa tggagcggtg aatgggaaggggtcactcaa gggacagccc ggagacatct 35220 accaccagac ctgggccaga tactttgtgaagtaagggat cagcaaggat gtgggatcag 35280 gactggcctc ccatttagcc atgctgatctgtgtcccaac cctcaaccta gttccacttc 35340 cagatctgcc tgtcctcagc tcacctttctaccttctggg cctttcagcc ttgggcctgt 35400 caatcttgcc cactccatca ggcttcctgttctctcggtc tggcccactt tctttttatt 35460 tttcttcttt tttttttttt tgagaaggagtctctctctc tgtcacccag gctggagtgc 35520 tgtggcgcca tcttcactca ctgtaacctctgcctcctga gttcaagcaa ttctcctgcc 35580 tcagccttcc aagtagctgg gattataggcgcctgccacc aggcccagct gatttttcta 35640 tttttagtag agacggggtt tcgccaggctgttctcgaac tcctgaactc aagtgatcca 35700 cctgcctcgg cttcccaaag tgctgggattacagtgtgag ccaccacacc cagctggtct 35760 ggtccacttt cttggccgga tcattcatgacctttctctt gccaggttcc tggatgccta 35820 tgctgagcac aagttacagt tctgggcagtgacagctgaa aatgagcctt ctgctgggct 35880 gttgagtgga taccccttcc agtgcctgggcttcacccct gaacatcagc gagacttcat 35940 tgcccgtgac ctaggtccta ccctcgccaacagtactcac cacaatgtcc gcctactcat 36000 gctggatgac caacgcttgc tgctgccccactgggcaaag gtggtaaggc ctggacctcc 36060 atggtgctcc agtgaccttc aaatccagcatccaaatgac tggctcccaa acttagagcg 36120 atttctctac ccaactatgg attcctagagcaccattccc ctggacctcc agggtgccat 36180 ggatcccaca gttgtcgctt gaaacctttctaggggctgg gcgaggtggc tcactcatgc 36240 aaacccagca ctttgggaag ccgaggcgggtgatcacctg aggtcaggag tttaagacca 36300 ccctggccaa cgtgttgaaa ccctgtgtctactaaaatac aaaaaaaaaa aattatctgg 36360 gcatgatggt gggtgtctgt aatcccagctactcaggagg ctgagaaggg agaatcagtt 36420 gaacccggga gatggtggtt gcggtgagccgagatcgcgc cactgcactc cagcctggga 36480 ggctgagcga gactccatct cgaaacaaaacaaaacaaaa ctatctaggc tgggggtggt 36540 ggttcatgta tgtatgtgta tatacatatatatgtgttta tatgtatata tatatacaca 36600 cacacacata catacacaca catacacacacaaattagct gggtgtggca cccgtgtagt 36660 cccagctact caggaggcta atgtgggaggatcagttgac cctaggaagt caaggctgca 36720 gtgagtcgtg attgcgccac tgtactccagcccgagtgac agagtgacat cctgtctcaa 36780 aaacaaaaaa aaatctcccc aaacctctctagttgcattc ttcccgtcac ccaactccag 36840 gattcctaca acaggaacta gaagttccagaagcctgtgt gcaaggtcca ggatcagttg 36900 ctcttccttt gcaggtactg acagacccagaagcagctaa atatgttcat ggcattgctg 36960 tacattggta cctggacttt ctggctccagccaaagccac cctaggggag acacaccgcc 37020 tgttccccaa caccatgctc tttgcctcagaggcctgtgt gggctccaag ttctgggagc 37080 agagtgtgcg gctaggctcc tgggatcgagggatgcagta cagccacagc atcatcacgg 37140 taagccaccc cagtctccct tcctgcaaagcagacctcag acctcttact agtttcacca 37200 aagactgaca gaagcccttc ctgtccagctttccccagct agcctgccct tttgagcaac 37260 tctggggaac catgattccc tatcttccctttccttcaca ggtctgcaca cctcattgcc 37320 ccttttgcaa ctactgaggc acttgcagctgcctcagact tctcagctcc ccttgagatg 37380 cctggatctt cacaccccca actccttagctactaaggaa tgtgcccctc acagggctga 37440 cctacccaca gctgcctctc ccacatgtgacccttaccta cactctctgg ggacccccag 37500 tgttgagcct ttgtctcttt gcctttgtccttaccctaga acctcctgta ccatgtggtc 37560 ggctggaccg actggaacct tgccctgaaccccgaaggag gacccaattg ggtgcgtaac 37620 tttgtcgaca gtcccatcat tgtagacatcaccaaggaca cgttttacaa acagcccatg 37680 ttctaccacc ttggccactt caggtgagtggagggcgggc acccccattc cataccaggc 37740 ctatcatctc ctacatcgga tggcttacatcactctacac cacgagggag caggaaggtg 37800 ttcagggtgg aacctcggaa gaggcacacccatccccttt tgcaccatgg aggcaggaag 37860 tgactaggta gcaacagaaa accccaatgcctgaggctgg actgcgatgc agaaaagcag 37920 ggtcagtgcc cagcagcatg gctccaggcctagagagcca gggcagagcc tctgcaggag 37980 ttatggggtg ggtccgtggg tgggtgacttcttagatgag ggtttcatgg gaggtacccc 38040 gagggactct gaccatctgt tcccacattcagcaagttca ttcctgaggg ctcccagaga 38100 gtggggctgg ttgccagtca gaagaacgacctggacgcag tggcactgat gcatcccgat 38160 ggctctgctg ttgtggtcgt gctaaaccggtgagggcaat ggtgaggtct gggaagtggg 38220 ctgaagacag cgttgggggc cttggcaggatcacactctc agcttctcct ccctgctccc 38280 tagctcctct aaggatgtgc ctcttaccatcaaggatcct gctgtgggct tcctggagac 38340 aatctcacct ggctactcca ttcacacctacctgtggcgt cgccagtgat ggagcagata 38400 ctcaaggagg cactgggctc agcctgggcattaaagggac agagtcagct cacacgctgt 38460 ctgtgactaa agagggcaca gcagggccagtgtgagctta cagcgacgta agcccagggg 38520 caatggtttg ggtgactcac tttcccctctaggtggtgcc aggggctgga ggcccctaga 38580 aaaagatcag taagccccag tgtccccccagcccccatgc ttatgtgaac atgcgctgtg 38640 tgctgcttgc tttggaaact gggcctgggtccaggcctag ggtgagctca ctgtccgtac 38700 aaacacaaga tcagggctga gggtaaggaaaagaagagac taggaaagct gggcccaaaa 38760 ctggagactg tttgtctttc ctggagatgcagaactgggc ccgtggagca gcagtgtcag 38820 catcagggcg gaagccttaa agcagcagcgggtgtgccca ggcacccaga tgattcctat 38880 ggcaccagcc aggaaaaatg gcagctcttaaaggagaaaa tgtttgagcc cagtcagtgt 38940 gagtggcttt attctgggtg gcagcaccccgtgtccggct gtaccaacaa cgaggaggca 39000 cgggggcctc tggaatgcat gagagtagaaaaaccagtct tgggagcgtg aggacaaatc 39060 attcctcttc atcctcctca gccatgcccagggtccgggt gcctggggcc cgagcaggcg 39120 ttgcccgctg gatggagaca atgccgctgagcaaggcgta gcccaccatg gctgccagtc 39180 ctgccagcac agataggatc tggttccggcgccggtatgg ctcctcctca gtctctgggc 39240 ctgctggtgt ctggcgttgc ggtggtacctcagctgaggg tcaaggaagg aaggtgtgtt 39300 aggagaacta gttcttggat ccctgcccactctccccagg gctgcccctc ccatctgccc 39360 cttacctcca tcccagggga agtagagactgagaatgtgg gtacaatagg cacagaggtt 39420 gtgcagccca cgcaggtgga cctgcagcttcccactgggc agctttgcct gcagcagcag 39480 ggccaagtag ctgaagacga aggcgtccaaggaggcaggg ctggagcaga gagagaaggg 39540 tgggatggag gagaaccact ggggtagaaggggtaaagat ggagctggag gaagagtcag 39600 ccttgggagg tgggctctgg gcagcaggcggccaccaggg aaggacagga cacacagttc 39660 tagacctggt atggggagag atccccaggtggcgccagct ggccctgaat agggctctat 39720 cccagggctg cataaagggc acactcagtgccccacagct cttcaggccc ttcctgtgcc 39780 tggctgccct cccaccctac ccttttgtacctctgagaag gctctggccc cacgcacagc 39840 cccactgtca ccagggccag tatctgtctcagggacctcc tatccagagc ctgagccagc 39900 cccagcccca gccccagctc cagctgctccatctgaacct gtatcttctt ccaagccacc 39960 cattaccctc ttggagtcag actcacgcatctccaaagaa gaacttttga gagcccaggc 40020 gctgagagag cagggtcaga cactcccgagcctctcggta cagctgtagg ggcgacacag 40080 gtaggcttgc agctgcggga acagtgccacctccgcacct aagcactccc attcctggcc 40140 agcatccttg gggctcatct catacaatagcccccggtct cagagctacc tccttctcca 40200 gctcttcctc gtcctcaggc ctgtgctccccagtcagcag ctgtagccgt tccatgtact 40260 gccgctgcat gcggccaggc aggaagaagttgaggggaaa gggcatagcc tctgcatacc 40320 acttccgggt cacttctacg tagttcttggtgtctatcca aaaagtatgt acctggattg 40380 ggtgggcagg aagaaacagg caggtctgagccagtgcacc tgtctgattc aaggtgggct 40440 tctgacctcc atgctctcct gagtctctgtgtgggtctgt gtgttcccgt cccctccccg 40500 gctggccatg gatgctggga ggtctgggcacactcaccag caccgggatc aacttctcct 40560 ccaggagaga catgaaggcc agggtgtctgccccttgctg agctgacaga tcataatcag 40620 cattgtactt ctgtggagga aatatccatggcgtggacgc tggggagctg caagggcact 40680 tcaccaggga ggaaggagtc ctgtctggtacccccctcac tggcctctga gtgcagtgga 40740 ggtacagcaa ggaacttttc ctgccaaggcccccttgcct gggcccagcc agtagcctgt 40800 tgctgttggc aaaaagcctg ggccttggagcccgctggcc gtcaaggtcc tgggcccatt 40860 gagaagaagg aagaaaggtt gggccgcaaactaggagcag ctcccagaat ttccatggaa 40920 agctggaaca atgcctgctg acagcaactttctaacagta actttcccga cccagacacc 40980 acaaagctag cacaacggag ctcagatgcaggctaggact cggtccatgc ctcaggaacc 41040 agggaaagcc atcctcacac tccctggatccagggaaccc acgcccaggg ccccccagct 41100 tgttccctca gtgcccagct cttggctatttctttcactt cattccatcg cccagacacc 41160 attaccacat acacattcca tccatacccccaggtctcag cctgccctac cttcccaggc 41220 tccagtccct gttcctcagc atcccccaccacatcctgag taagctttgt ccccagataa 41280 cctcttcagc atgatcctta aatctccctaagcctcagtt tctcccctgt ggaatggggg 41340 taagaatctc tttctctgaa tgcccctgtgttaggaaata atttagaata cttcggaaac 41400 aaaaagctct gttcacacct aagcaatcagggcagtggcc tggccttgcc aggaacttag 41460 gcttttatct ggatcctctt tccaggcctctcaattaatt ccccaggtcc ttaacctttg 41520 ggaaattaga aattaggaag agtgtcccacttctgacact gtgttccctc ttggaacctg 41580 accgtcaatg ctagaagaac ccttggaaaacatgctggcc cagccctcta gttttacaaa 41640 taagggagtg cacagccctg agaggttacatggcctgccc aagatcacgc agtcaatggc 41700 agagtaaaga gcatagccta ggcctccccactcctctagt aatgctcttt catcttctcc 41760 aacctggctc taagccttgt ccatcctgagccccatatct agcccaacct agtccctgaa 41820 aacaagaagt ggcccttaga aatctctctccagtcccact atcagaggcc aactgctgtc 41880 ttccagtctc cttcagcctg tgctcctctccctccctgac tgacaggcag aaggtaccgt 41940 gcctctggat atccccacag tgccctgagctgcatctctt gccgactgct ttaatacatc 42000 acagtgacat tgtgtgtgtc tctgccaccagactattgct ccttgatgct ctgggtcacc 42060 tgcatctagc atggcatata tctagtgctcaataaatgtg tattgtacgg aattgactga 42120 acttctctca ctggcagccc cctctatccaaatcacccac ctctttttga aggtgggtga 42180 tgatcttgtg tggtactgag gtgacctctccatgactggt ccaaagggca ggcagggttc 42240 ctgatactga gagaaaagta taccaaccaggaccttagct gcctatccat acgtagttgc 42300 aacacattcc tgcctatctc ctgcttctcctcttgtacac acccttcctc acccccaagg 42360 gatactgggt acctgaaggg ctctgccaggggttgctgat cttgtgctat atggtgcaca 42420 agaaactttt aagaaaaaag aaactttcaagacacacttt ccaaccacga attctatctc 42480 tgctcctttt catagcaata ggttttctttttcttccctc cacacttaaa catccattct 42540 cttatcaccc acccccatca gactccttcccctgtgtttc ttcagccact gctctgacca 42600 aaatttgagt gaccaaaagt ggtgtcagacccagtgacca tttctctgcc tgcatttcac 42660 ttgaccttga agcagcaatt aatctccataatcagcatct tcttgaattt ttccctttga 42720 ggacattgct cttctacttt gtgctctggttatcctttac aaacttttac acctctcctg 42780 aactggttaa cagtagaggg ccccaagggtctcacgtaag ccttctttgt ttttgttttt 42840 tttcttttct tttatttttt atatatttttttgagacgga gtcttgctct gttgcccggg 42900 ctggagtgca gttgcacgat ctcggctcactgcaagctct gcctcccagg ttcatgctat 42960 tctcctgcct cagcctcccg agtagctgggactacaggtg cctgccacca tgcctggcta 43020 attttctgta tttttaacag agacagggatacaccatgtt agccaggatg gtctcgatct 43080 cctgacctcg tgatctgccc acctcagcctcccaaaatgc tgggattaca ggcgtgagcc 43140 actgctccca gccgtcattt ttattttattttatttattt ttttgagatg gagtcttgct 43200 cttttgccag gctggagtgc atggtgcgatctcggctcat tgcaaccccc gcctcccagg 43260 ttcaagcgat tctcctgcct cagcctcctgagtggctggg actacaggtg cctgtcacca 43320 tgcctggcta attttctgta ttttagcttgagacacggtt ttcaccttgt tagccaggat 43380 ggtctcgatc tgtgagcctc gtgatctgcctgcgtcggcc tcccaaagtg ctgggattag 43440 caggcagtga gccactgcac ccggccatcaattttttttt tttttaaatg gagtcttgct 43500 ctgtcaccca ggttggagtg taacagtgcaatcttggctc attgcaacct ccgcctcttg 43560 ggttcaagcg attctccagc ctcagcctcctgagtagctg ggactacagg tgcatgccac 43620 cacacccggc tagtttttgt atttttagtagagacggggt ttcaccatgt tgtccaggat 43680 ggtctcaaac tcctgacctc aaatgatgtgcctgccttga ccttccaaag tgctgggatt 43740 agaggcgtga accacattgc ccaccgtaagccctcttttc tacttccact ctttctgtag 43800 gtggccctac tactaacgtc catggctttaagtaccatct ttctatgtgt taatgcataa 43860 gtccagcctt gacctctctt gagcgccatccaactcagca tatctggttg gatgtctaat 43920 gagtatttca aattcaacat ggccacaactgaactaactg aactcttttt tttttttagg 43980 cagagttttg ctcttgttgc ccaggctggagtgcaatggc gagatcttgg ctcactgcaa 44040 cctctgcctc cagggttcaa gcgattctcctgcctcagcc tcccgagtag atgggattat 44100 aggccccgct acccggctaa tgtttttgtatttttagtag agacagggtg ttgccatatt 44160 gaccaggctg gtctgcaaat cctgacctcaggtgatcccc ctgcctcggc ctcccaaagt 44220 gctggaatta caggcgtgag tcactgccccggccacacaa ctgagctctt cattcccacc 44280 ccagcaccgg ttatttttcc actgttcccatctcaatggc acctccatta cccatttgca 44340 cattccaaaa gcccaggaac catggtgacttcttttccca tatccaacac aaccaatcct 44400 atcttgaatt catccacgtc ccaccacctccccagctacc tagttccagc caccctctct 44460 ccacaacctc tgaatcaatc tttcacttttcccagcaatc cattctccac tcagcaaaat 44520 gatgataaag cacgtcacat caaggctctgcctcaattta atggcttccc attgtattta 44580 gaatcatctc caagctctca gagactatggtcagctacaa tctggcccac cttctgttcc 44640 agccaaattt cctcacagca caaggacgtttgcacctgct attttccaag cacgaaaccc 44700 tcgggcagat atatctggtg ctgtcacctaatttcaggtt ttaactccac tctcaccatt 44760 tcagtgagga cctaatcccc atcgcagtcattctatcaca tagctttatt ttattttatt 44820 ttattttttt ttgagataca gtctggctctctcacccagg ctggagtgca gtggtgtgat 44880 ctgggctcac tgcaaacttc catctcctgcgttcaagcga ttctcctgcc tcagcttccc 44940 gagtagctgg gattacaggt gtctgccatcacgcctgact aagttttgta ttttcagtag 45000 agacggggtt ttgccatgtt agccaggctggtctcgaact ccttgacctc aagtgatcca 45060 cctgcctcag cctcccacaa tgttggatttacaggcgtga gccactgctc ccggccacat 45120 agccatgttt taagcatgta tcactatctaaaattatttt ttgtttatac gtttgtgtcg 45180 tcctgtagaa tgtaaggtca caagatcagggacttgctca ttgcactggg tctaacacac 45240 agtgcttcaa caaacactcg ttaagatactaacgtggcag agtggggcct tgtaaacagt 45300 ttcaggaccc tgtgcttgta agagcaacgtggtgccctcc caaggaagac agggaggatg 45360 caggagcact gcccagagat ggcgtcaggctgcaagacat cttgaataat tcaccatcgt 45420 aacaacccag cctcaggaag agatggggcaaggccagaac gaaacattag gtaagaggcg 45480 gtggacaatg ggattcccac agggcagcttttgggcactg gacgttccct aacctgaggc 45540 tctctgaaga ggaaggttag gaatcctctcagcttcggtg ggctggactc actgtgggaa 45600 ttcaatcgcc cccccccacc cactaagggtgtgctggcgg gaaagcgctg agacgcatgc 45660 gtagttctcg cgtctggcac ccgctccctttccaatacgc ttgcgccccg tctgtgctac 45720 ggatggtcag ggagagttgt ccgtcttcaaatggaccaat gagacttgtg gaggggctct 45780 gagtcccgcc tctggatgag tgaccgtctcttttccaagt gaggcccgcc cgcctcccca 45840 gggctgcttt tctcgcggca cgggtggtggggctgcttct tgacttccgc gcctagtgcg 45900 cagggcccca ttctccagtc ccgcccacgcgcctttggag gctgcggtgg gatttccttt 45960 tgccttcggt tggggctgct gtttctcttcgccgacggta ggcattataa atatttcgcc 46020 ctttgaattt tagctctccc ctcccggcgttcgcacttag cctttttcat cattatcatt 46080 atttttaatg gtaaggatgg tcaaattttactaatgaagc gattataaaa tcttcaagtc 46140 tttgtatcca cttgcttttt gagggctggagtgggaagaa aggtatataa ttcattcatt 46200 cttcggacat gtgacaaacg ttcacggagcgcggcaacga gcgccggtgt cgcgatgcgc 46260 actggggctg cacatgggag ccaggatggactggactggt ccctgccctg cccgctgacg 46320 attggcaggc cactgccttt gatgagctgggcgctataac tgccattaag ccatttgtac 46380 aattaatcac aacagtgata agagcgacaaaggagctatt cgggtggctg aggcggagga 46440 tcttttgagc ctaggagttc gagaccagcctggggaatag agcgagatcc tgtctcaaaa 46500 aaagggggag aaaaagtgac aaagaagttgtgagggctat gagtttgcgt tagaggaggt 46560 gtgtgaggtg gggaggcggc agggggcgcggttgttctta gagaaagtga catccgggct 46620 aattttgaaa gaataagtgt taactaggctaagcggtggg aaagagtagt gtgtgctgaa 46680 ggaagggaag aggaacctgg caagttcatggcaggcctgt aattggatgc tggtctccct 46740 ccacacttga ctcctacagt ctgatttctacacagcaccc aaagtgatct cttaaaaata 46800 cacatgtgat cttgtcactt cccagcaccaaactctacag tgaatttcca tcttagaaca 46860 aaattcagac tccttaccat ggccaccaagaccctacaca atctggcctc ccaatttcct 46920 tttccaaggt caccttttac cactatccatctcactcaca ctgctccagc actctctatg 46980 cttttatttt tctttcttcc tctttctttctttctttctt tttctttttc tttctttctt 47040 tccttccttc tctctctctc tctttcttttcttttctctt tctttttttt ttttttgaga 47100 tgcagtcttg ctctgtcacc caggtgtgtgatcttggctc actgtcaacc cagagcagtg 47160 ggggtgatct tggctcactg cagcctccgccttccaggtt caagcaattc tgcctcagcc 47220 tcctgagtag cagggattag cgccatcagtcccagctaat ttttgtattt ttagtagaga 47280 tttcatcatg ttgtccaggc tggtctcgaactcctgactt caagtgatct gcccatctca 47340 gcctcctaaa gtgctgagat tacaggcaggagccaccaca cctggctcag tatttgtttt 47400 atttttgttg tgtttattta tttagagacaaggtcttgct ctgttgctca ggctggagtg 47460 ctgtggcagg aacacagctc actgcagcctccactacctc aattcaggcg atcctcccac 47520 ctcagctgag actacaggtg tgcatcaccatgcctggtta attttttcgt tttaccatgt 47580 tggccaggtt tgtctcgaac tcttgggctctgctgtcctc ctaccttagc ctcccaaagt 47640 gctaggattg taggcgtgag ccactgtgcccagctggtgt tcagtatttg aatccatatt 47700 tcctgtagcc gcaaccaaag ttccactgttaggtctcact ttgactttaa taattgtgtt 47760 caggctgggc acactggctc aagcctgtaatctcagcagt ttgggaggcc gaggtgggtg 47820 gatcacatga ggtcaggtgt ttgagaccagcctgtccaac atggcgaaac cctgtctcta 47880 ctaaaaatac aaaacattct ccgatcgtagtggcgggcgc ctgtaatccc agctacttgg 47940 gaggctgaag gaggagaatt gcttgaacctgggaggcaga ggttgcagtg agcggagatc 48000 acgtcattgc actccagcct gggctacagagcgagactct gtctcaataa gtaaataaat 48060 aaataaataa ataaataaat aaataaataattgtgttcgg agtcagcatc attcttcctg 48120 aagttcacca ctcctttgcc aagaacacttcctgaaacac tgacaagcag gaccttgaat 48180 aatggggtat ggttggtaac aactcactcattcaacaaac attgagcacc tattttgtgc 48240 ttgcctctaa atgaagagct ggttgtattatttatttttt taggtgacag ggctttccct 48300 atgttgccca ggttcgtctc aaactcctgggctcaaaaga tcctcatctt ctcaagtggt 48360 tgaatataca cgctccagcg accatgcctggctgaatgaa gagctttgag attttgaaga 48420 aacaggaacc atgaaatttg ctttgcaactgtttgcaacc tttaaggaag actgaaaagg 48480 cattcctgaa gcatgtgaga agcagtctgtgtgacctgat gactcagaac tgcttggaat 48540 ttagattagg acagatatga gcttaggcttcactctgcca catatttaac ttctctaagt 48600 cttagttttc ttttcttttt ttttttttttgagatggagt ctcgctctgt cacccaggct 48660 ggagtgcagt ggcacgatct tggctcactgcaagctccgc ctcccgggtt cacgccatcc 48720 tcctgcctca gccacccgag tagcggggactacagtcgca caccgccacg cctggctaaa 48780 tttttgtatt tttagtagag acgggtttcaccgtgttagc caggatggtc tcgatctcct 48840 aaccttgtga tccgcccgct tcggcctcccaaagtgttgg gattacaggt gtgagccatc 48900 gcgtctggcc tctaagtctt aattttcttatctgtaatgt agagttgtca ggctagtgca 48960 tgtaataagc actcatgaaa gactgactattatcttgcga aaaattggaa gagatcatat 49020 gaggtacaag gaccttccat ctgcactgggctgtacattg aatgttgtgg ttactgttga 49080 agcattggta aatcacatga atccaatggtaaaaccacac cctaaggtca gactcagtgg 49140 ctcacgcctg taattccagc actttgggaaaccaaggcaa gaggattgct tgagctcagg 49200 agttcgatac cagtctaggc aacctgtttctataaaaagt taaaaaatta gcttggtgtg 49260 gtggtgtgca cttctggtcc cagctactcaggaggctaag gtgggaggat cccttgagcc 49320 caggtggtcg aggctgcagt gagccaggatcacaccattg cattccagcc tggatgacag 49380 tgtgaggccc tgtcttaaaa aagacaaaaacaaaccaaaa aaacccacac cctagtgggt 49440 aaggggcagc agaggtccca cccaagagtgaaatcatttt tggtgtcagt aatcagggag 49500 agtaatatac cctcaggcca gaaactagaggtgagtgatg gctcacgcct ataatcccag 49560 caccttggga ggctgaggca ggtggattgcttgacctcag gaattcgaga ccagcctggg 49620 caacatagga agaccccatc tctaaaaataaatttcaaaa attagttggg catggtggca 49680 tgtacctgta gtcccagcta ctatggaggctgaggtggga ggatccaatt gagcctgaga 49740 ggtcaaagct gcagtgaatt gtgattgtgccaagaccctg tctctaaaat aaaataaaat 49800 tacaattaaa aactagagct gaaaggacaggttctaggtt aactggtagt agttgttcat 49860 tcatctgtat aaaaagtact tttttgagcacctactctgt gctgtgcatg gaatgaacaa 49920 atgtctttat ttgacttgaa cacagaccatagaagtaact aacagtggaa ctgggagtgg 49980 cactctccat gggacctaaa gacctaggagcctgtgatat gatacctgtc agtgtgaaag 50040 ggctggagtt tgctatctga gccagacctacactccaaga caggtcttgc tgggatggga 50100 tcaaaattgc tgctgggtca gccgaactatttttgtactt ctgcttttca cttatattta 50160 tgtcttccct aggtatatag ctgtctttttttgtttgctt gtttttctga gtcagagtct 50220 tgctctgtca ctcaggctgg agtgcagtggcgcgtctcgg ctcactgcaa catccacctc 50280 gggttcaaac gattctcctg cctcagattcccgagtagct aggattacag gtgcccacca 50340 ccatgcccgg ctaatttttt gtatttttaatagagatggg gtttagtaga gacagggttc 50400 actgtgttat ccaggatggt ctccatctcctgacctcgtg atctgcccgc ctcagcttcc 50460 caaagtgctg ggattacagg catgagccactgcgcccggc cactggagac cccttttcta 50520 ccaaaaaaaa taaaaataaa taaataaatatatatatata tttgtaaaga cggagttctc 50580 tatgttgccc aggctggtct tgaactcctggcctcaaggg atcctcttgt ctcagcttct 50640 taaaatgatg gggttacagg catgagccaccgtaccaggc cttagcaaac tctttttcaa 50700 gtgctataca aaggggacag aggaacgtgattgtggccac ctagtatgcc cctattggcc 50760 actgcagtga ggcctaggtg tttgaagagaggcacctagg gtagtggctc tcaggcacac 50820 cctaggggca ttttggaaat ttgtggggcatctttaatag agacaatgat taggaggcac 50880 tttgagaatt tagtggatgg agcccaggaaaggtagacat tctgcaacat gtagtacatt 50940 catacacaag gaagaattca ctcatttcacctgacttaca aatataaaat caaatataaa 51000 tcagtaggac tttattataa aatattgcagaaaagtttca caagaatttt ttgtagaaaa 51060 agcccaatca gggccgggca tggtggctcatgcctgtaat ctcaggactt tgggaggccg 51120 aggcgggtga atcacctgag gtcaggagtttgggacaagc ctggccaaca tggtaaaacc 51180 ccgtcttgac taaaaataca aaaatcagccaggcatggtg gtgtgcgtct gtaatcccag 51240 gtactcggga ggctgaggca ggagaatcgcttgaacccag gaggcggggt ttgcagtgag 51300 ccaagatcaa gccactgtgc tccagcctgggtgaaagagc aagactccgt cgaaagaaag 51360 aaaggaaggg aggaagggag gaagggagggagggagggag ggagggaaag aaagaaaaaa 51420 gagaagagag gggaggggag ggaagaggagaaaagaagga aggaaggcag gaaggaagga 51480 atccagtcga gatcgttaaa tttgttgtaattttgtcttt ggcacaataa acattagtcc 51540 ttattaaaac taactttact tagaaaataaaatagaaaat cctcaaggac aaatcacatt 51600 gtacaggaaa tatgaaatgt gaaataatatccaccttgga aatttttttt ttttgaaaca 51660 gagtgttact ctgtctccca ggctggagtgcagtggcaca atcttgggtc tctgcaacct 51720 ctgactcctg ggttcaagtg attctcgtacctcagcttcc caagtagctg ggattacagg 51780 cgtgcaccac cacacccagc taatttttgtattttcattt cagatgggat tttgccatgt 51840 tggccatgaa cacctgtcct caagctatccactgcctcag cctcccaaag tgctgggatt 51900 ataggcatga gctaccgtgc ccagccaccctggaaaaatt ttacaaaatg aagtataaaa 51960 agaagaaaag aggggcgggc atggtgccttgtgcgtgtaa tcctagaact ctgcaaagcc 52020 gaggcaggag atcctttggg tttaggagtttgagaccaac ctgcacaaca tagcaagacc 52080 ctatttctac aaaaaataca gacactaggctgggcgtggt ggctcacgtg taatcccagc 52140 actttgggaa gctgaggcca gcagatcacgaggtcaggag atggagacca tcctggctaa 52200 catagtgaaa ccctgtctct actaaaaatacaaaaaatta gccgggtgtg gtggcaggcg 52260 cctgtagtcc cagctactca ggaggctgaggcaggagaat ggcgtgaatc cgggaggcgg 52320 agcttgcagt gagccgagat cgtgccactgcactccagcc tgggagacag agcaagactc 52380 cgtctcaaaa aaaaaaaaat tttgctcagtacctggccaa aaaagaagca gctcactccc 52440 tgtacacaga ggggtaagag aaaggagatggttgaacttc taaactcgct aaagcaggag 52500 aggcaaatgt ggaatgtgct caggaaatatctgtgagatg aatgaatttg agggaagtaa 52560 ggtactagat aattacctgc cctacccagaacaaatcctg tgcaacgttt ccttgaagag 52620 caggaagtca ggccgggtgc tgtgctcacgcctgtaatcc cagccctttg ggaggccaaa 52680 gtgtgcgaat cacctgaggt caggagattgagaccagtct ggctaacatg gtgaaacccc 52740 atctctacta aaatacaaaa attagccgggcgtggtggtg cgtgcctgta gtcccaacta 52800 cttgggaggc tgaggcagga gaattgcttgaacctgggag gcagaggttg cggtgagctg 52860 agatcggcca ctgcactcca gactgggtgacagagtgaga caacatctca aaaaaacaaa 52920 aaaaaaagag aaaagcagga agtctggaaggggtggctac tgacatagtg aagcaactag 52980 ttcaattcta caacttgaca actacccctgtgccaggctg tctacaagga tatttagaat 53040 gtgtaagaca ttccttcaag gaactccaggaacagaggcc tgacatgttg caatgtttag 53100 tgtcaagcag tgtactagag acacattatcacactcaaac ctcacaacag ttctatgagg 53160 taggagttat cactcccctt ttatagatgaaatagaggct tagagtgatt gatttactga 53220 aggtcaaaca gccagtaaat ggtgtagcaaggattccaac cttgccgtct cactaaaact 53280 gtacaaaaaa agatacaaac aacagacaaatagttcccag gcgcctccca agttgccagg 53340 cactgcattt acctcactga cccctttgaggttgtggcat tgcctccatt ttctaggtga 53400 ggaaataggc tgagagctgg ggttagtctggtcatgactg tgtgtgccac tcccaccaaa 53460 tctcatttga tgtggttcat gaggcaaatggcatggacaa cttccttcac atgtccacta 53520 agcatatggc cttttacaac actttctggtttctgaacta ctttaaaacc tcactgtcct 53580 gtgaggaagg aagaacagtt attacaatctgcatctggaa gccaattgcc ctttagagat 53640 atggctgcaa ttgcctcact gcctgtgtcatgtgactctc cgaggccctt aatgagtaaa 53700 tgaggggtgc tgcaggggag ccaagctgaccactcccctc cctccagtcc tgccacccca 53760 ctgccagtgt cccaccctcc ttgcgccctacacttcactg gctaataacc cccctcactt 53820 tttcctgtgt tgaaggcatc ctggataattccccacccac gaatggtccc tcctcatctc 53880 agagagctct ccatgcacac ctgttactgtttctgttttt acctgtaaat atctgtgtct 53940 gacttccatg cttcatgcac ctctatagggcaaagactgt gtcttaaaca tcacggtagc 54000 ctcagcatgt tgtgcaatga aggtttttttgtttttgttc tttgtttttt ttttggtatt 54060 agctttattt gtatcatttt gaaatttttatcaaaaaagc agcgtgcctg ctgtggttcc 54120 catcctctgg gatttaggaa tctttacccgattctccatc caagtctgtc tttcgtattc 54180 taggctcttc ctaaagttgt cattcacatataccctccag aattttatag ggtgtataat 54240 ctgtaacaac tcggaggaag ccaattgccctttagaaata tggctgcaat tgcctcactt 54300 cctgtgtcat gtgactctcc tagtcatcacatgacccatc cacattggga agccagaatt 54360 acttgcagga gtaacctagt gcctatagctatggcaggta tcctgcatcc ttgttttttg 54420 tttagtggat cctctatcct tcagagactctggaacccct gtgctcttct cctcatctag 54480 tgaccctgag gtgatggagt tttcaagtccttccagagag gtaagagaga gagctcccaa 54540 tcagcattgt cacagtgctt ctggaatcctggcactggaa tttaatgaat gacagactct 54600 ctttgaatcc agggccatca tggctctttgagcaaggcac agatggaggg aggggtcgaa 54660 gttgaaatgg gtgggaagag tggtggggagcatcctgatt tggggtgggc agagagttgt 54720 catcagaagg gttgcaggga gagctgcacccaggtgtctg tgggccttgt cctaatgaat 54780 gtgggagacc aggccatggg cacccaaaggcagctaagcc ctgcccggga gagtagttga 54840 ggggtggaga gggacttgct tttcagtcattcctcattct gtcctcagga atgtcccaag 54900 ccttcgggta gggtaagcat catggctggcagcctcacag gattgcttct acttcaggca 54960 tgtcgtgggc agtcagatga gtgagtcaaggcagtgggga ggtagcacag agcctccctt 55020 ctgcctcata gtcctttggt agccttccagtaagctggtg gtagactttt agtaggtgct 55080 caataaatcc ttttgagtga ctgagaccaactttggggtg aggatttttg aaaccgtctt 55140 cagtctctcc aaacagctgt gtccgttctccacatccttg tcagacctca cctctgcttg 55200 tgctccctcc ctcccaggtg gtgcccctgcatccctaaaa gcttcagtac agctcggtgg 55260 tctgtgtctg caatgccaca tactgtgactcttgaccccc cgacctttcc tgccctaggt 55320 gccttcagcc gctacaagag cagaagcagtgggcattgga tggagctgag tacaggacca 55380 tacaggctaa ttgcaccggc acaggtaaccattacaccct tcaccccccg ggccaggctg 55440 ggtcctccta gaggtaaacg gtgtcagtgatcaccatgga gtttctccct gggcactgat 55500 aaccctgtgg atgtcctcag gcctgctactgatcctgcag ccagaagttc cagaaagtga 55560 agggatttgg aggggccgtg acagatgcaggtgccctcaa catccttgcc ctgtcacccc 55620 ctgcccagaa tttgctactt aaatggtacttctctgaaga agatgaggag gaaggggaca 55680 ggatgacata gagccactga cacttttctttgccaattct tgtggaccct gacttctgcc 55740 catccctgac atttggttcc tgtcttaatgccagtgaaat aagatttcgc cgcctatcat 55800 ctgctaactg ctacggactc aggctcagaaaggcctgcgc ttcacccagg tgccagcctc 55860 cacaggttcc aacccaggag cccaagttccttttggccct gactcagaca ctattaggac 55920 tggcaagtga taagcagagt cccatactctcctattgact cggactacca tatcttgatc 55980 atccttttct gtaggaatcg gatataacatcatctgggta cccatggcca gctgtgactt 56040 ctccatccgc acctacacct atgcagacacccctgatgat ttccagttgc acaacttcag 56100 cctcccagag gaagatacca agctcaaggtaggcattcta gctttttcag gccctgaggg 56160 ccctgatgtc tgggggttga gaaactgtagggtaggtctg cttgtacaga cattttgtcc 56220 cctgctgttt tgtcctgggg gtgggagggtgggggctaat ggctgaaccg gatgcactgg 56280 ttgggctagt atgtgttcca actctgggtgcttctctctt cactaccttt gtctctagat 56340 acccctgatt caccgagccc tgcagttggcccagcgtccc gtttcactcc ttgccagccc 56400 ctggacatca cccacttggc tcaagaccaagggagcgggg aatgggaagg ggccactcaa 56460 gggacagccc agagacatct accaccagacctgggccaga tacattgtga agtaagggat 56520 cagcaaggat gtgggatcag gactggcctcccctttggcc atgctgatct gtgtcccaac 56580 cctcaacctg gttccacttc cagatctgcctgtcctcagc tcacctttct accttctggg 56640 cctttcaaac ttggatctgt cagtcttgcccactccatca ggcttcctgt tctctcggtc 56700 tggcccactt tcttggctgg atcactcatgacctttctct tgccaggttc ctggatgcct 56760 atgctgagca caagttacag ttctgggcagtgacagctga aaatgagcct tctgctgggc 56820 tgttgagtgg ataccccttc cagtgcctgggcttcacccc tgaacatcag cgagacttca 56880 ttgcccgtga cctaggtcct acccttgccaacggtactca ccacaatgtc cgcctactca 56940 tgctggatga ccaacgcttg ctgctgccccactgggcaaa ggtggtaagg cctggacctc 57000 catggtgctc cagtgacctt caaatccagcatccaaatga ttggctccca aacttagagg 57060 gatttttcta cccaactatg gatcctagagcaccattccc cgggacctcc agggtgccat 57120 ggatcccaca gttgggactt gaaacctctctaggctgggg gtggtagctc atggctataa 57180 ttccagcact ttgggaaccc aaggtgggtggatcacttga acctaaggag ttcaagatga 57240 gcctgggaaa catggtgaaa ccctaactctacaaaaaaaa aaatagaaaa gttagccggg 57300 tgtggtggtg gcacgcctat agtcccaagtattctggagg ctaaggcggg aggtttagtt 57360 gagcctagaa tttcaggctg cagtgagctatgattgtgcc actgtactcc agcctgtgtg 57420 acagagggag accctgtctc aaaaacaaaaacaaaaaatc cctcccaaaa cctctgtagt 57480 tgcattcttc ccaccaccta attcaggattcctacaagag gaactagaag ttccagaagc 57540 ctgtgggcag ggtccagggt gacttgttcttcctttgcag gtactgacag acccagaagc 57600 agctaagtat gttcatggta ttgctgtacattggtacctg gactttctgg ctccagccaa 57660 agccacccta agggagacac accacctgttccccaacacc atgctctttg cctcagaggc 57720 ctgtgtgggt tccaagttct gggagcagagtgtgcggcta ggctcctggg atcgagggat 57780 gcagtacagc cacagcatca tcacagtaagccaccccagt ctcccttcct gcaaaggagg 57840 acctcagacc cattagtagt ctcaccaaagactgatagaa gcccttcctg tccagctttc 57900 cccaggtagc ctgccctttt gggcaactctggggaaccat gattccctgt cttgcctttc 57960 cttcacaggt ctgcacacct cattgccccttttgcaacta ctgaggcact tgcagctgcc 58020 tcagacttct cagctcccct tgagatgcctggatcttcac acccccaact ccttagctac 58080 taaggaatgt gccctcacag ggctgacctacccacagctg cctctcccac acgtgaccct 58140 tacctacact ctctggggac ccccagtgttgcgcctttgt ctctttgcct ttgtccttac 58200 cctagaacct cctgtaccat gtggtcggctggaccgactg gaacccatca ttgtagacat 58260 caccaagcac acgttttaca aacagcccatgttctaccac cttggccact tcaggtgagt 58320 ggagggcggg gcacccccat tccataccaggcctatcatc tcctacatcg gatggcttac 58380 atcactctac accacgaggg agcaggaaggtgttcagggt ggaacctcgg aagaggcaca 58440 cccatcccct tttgcaccat ggaggcaggaagtgactagg tagcaacaga aaaccccaat 58500 gcctgaggct ggactgcgat gcagaaaagcagggtcagtg cccagcagca tggctccagg 58560 cctagagagc cagggcagag cctttgcaggagttatgggg tgggtccgtg ggtgggcgac 58620 ttcttagatg agggtttcat gggaggtaccccgagggact ctgaccatct gttcccacat 58680 tcagcaagtt cattcctgag ggctcccagagagtggggct ggttgccagt cagaagaacg 58740 acccggacgc agtggcactg atgcatcccgatggctctcc tgttgtggtc gtcctaaacc 58800 ggtgagggca atggtgaggt ctgggaagtgggctgaagac agcgttgggg gccttggcag 58860 gatcacactc tcagcttctc ctccctgctccctagctcct ctaaggatgt gcctcttacc 58920 atcaaggatc ctgctgtggg cttcctggagacaatctcac ctggctactc cattcacacc 58980 tacctgtggc gtcgccagtg atggagcagatactcaagga ggcactgggc tcagcctggg 59040 cattaaaggg acagagtcag ctcacacgctgtctgtgact aaagagggca caacagggcc 59100 agcgtgagct tacagcgacg taagcccaggggcaatggtt tgggtgactc actttcccct 59160 ctaggtggtg ccaggggctg gaggcccctagaaaaagatc agtaagcccc agtgtccccc 59220 cagcccccat gcttatgtga acatgcgctgtgtgctgctt gctttggaaa ctgggcctgg 59280 gtccaggcct agggtgagct cactgtccgtacaaacacaa gatcagggct gagggtaagg 59340 aaaagaagag actaggaaag ctgggcccaaaactggagac tgtttgtctt tcctggagat 59400 gcagaactgg gcccgtggag cagcagtgtcagcatcaggg cggaagcctt aaagcagcag 59460 cgggtgtgcc caggcaccca gatgattcctatggcaccag ccaggaaaaa tggcagctct 59520 taaaggagaa aatgtttgag cccagtcagtgtgagtggct ttattctggg tggcagcacc 59580 ccgtgtccgg ctgtaccaac aacgaggaggcacgggggcc tctggaatgc atgagagtag 59640 aaaaaccagt cttgggagcg tgaggacaaatcattcctct tcatcctcct cagccatgcc 59700 cagggtccgg gtgcctgggg cccgagcaggcgttgcccgc tggatggaga caatgccgct 59760 gagcaaggcg tagcccacca tggctgccagtcctgccagc acagatagga tctggttccg 59820 gcgccggtat ggctcctcct cagtctctgggcctgctggt gtctggcgtt gcggtggtac 59880 ctcagctgag ggtcaaggaa ggaaggtgtgttaggagaac tagttcttgg atccctgccc 59940 actctcccca gggctgcccc tcccatctgccccttacctc catcccaggg gaagtagaga 60000 ctgagaatgt gggtacaata ggcacagaggttgtgcagcc cacgcaggtg gacctgcagc 60060 ttcccactgg gcagctttgc ctgcagcagcagggccaagt agctgaagac gaaggcgtcc 60120 aaggaggcag ggctggagca gagagagaagggtgggatgg aggagaacca ctggggtaga 60180 aggggtaaag atggagctgg aggaagagtcagccttggga ggtgggctct gggcagcagg 60240 cggccaccag ggaaggacag gacacacagttctagacctg gtatggggag agatccccag 60300 gtggcgccag ctggccctga atagggctctatcccagggc tgcataaagg gcacattcag 60360 tgccccacag ctcttcaggc ccctcctgtgcctggctgcc ctcccaccct acccttttgt 60420 acctctgaga aggctctggc ccccacacagtcacactgtc actagggcca gtttctatcc 60480 cagggacctc ctatccagag cctgagccagccccagcccc agccccagct ccagctgctc 60540 catctgaacc tgtatcttct tccaagccacccattaccct cttggagtca gactcacgca 60600 tctccaaaga agaacttttg agagcccaggcgctgagaga gcagggtcag acactcccga 60660 gcctctcggt acagctgtag gggcgacacaggtaggcttg cagctgtggg aacagtgcca 60720 cctccccacc taagcactcc cattcctggccagcatcctt ggggctcatc tcatacaata 60780 gcccccggtc tcagagctac ctccttctccagctcttcct cgtcctcagg cctgtgctcc 60840 ccagtcagca gctgtagccg ttccatgtactgccgctgca tgcggccagg caggaagaag 60900 ttgaggggaa agggcatagc ctctgcataccacttccggg tcacttccac gtagttcttg 60960 gtgtctatcc aaaaagtatg tacctggattgggtgggcag gaagaaacag gtaggtctga 61020 gccagtgcac ctgtctgatt caaggtgggcttctgacccc catgctttcc tgagcctgtg 61080 tgtgggtctg tgtgttcccg aaccctccccggctggccat ggatgctggg aggtctgggc 61140 acactcacca gcaccgggag caacttctcctccaggagag acatgaaggc cagggtgtct 61200 gccccttgcc gagctgacag atcataatcagcattgtact tctgtggagg aaatatccat 61260 ggcgtggaca ctagggagct gcaagggcacttcaccaggg aggaaggagt cctgtctggt 61320 acccccctca ctggcctctg agtgcagtggaggtacagca aggaactttt cctgccaagg 61380 cccccttgcc tgggcccagc cagtagcctgttgctgttgg cgaaaagcct gggccttgga 61440 gcctcctggc cgtgaaggtc cagcgcccaatgcagggaag gaaggaaggc tccgccgcaa 61500 actaggagca gctcccagaa tttccatggaaagctggaac aacgcccgct gacggcaact 61560 ttctaacagt aacttccccg acccagacaccacaaagcta gcacaacgga gctcagatgc 61620 aggctaggac tcggtccatg cctcaggaaccagagaaagc catcctcaca ctccctggat 61680 ccagggaacc cacgcccagg gccccccagcttgttccctc agtgcccagc tcttggctat 61740 ttctttcact tcattccatc gctcagacaccattaccaca tacacattcc acccataccc 61800 ccaggtctca gcctgcccta ccttcccaggctccagtccc tgttcctcag catccccctc 61860 cacatcctga gtaagctttg tccccagataacctcttcag catgatcctt aaatctccct 61920 aagcctcagt ttctcccctg tggaatgggggtaagaatct ctttctctga atgcccctgt 61980 gttaggaaat aatttagaat actttggaaactggaaaagc tctgttcaca cctaagcaat 62040 cagggcagtg gcctcggctc tgccaggaactttggctttt atctggatcc tctctttcca 62100 ggcctctcaa ttaattcccc aggtcctcaacctttgggaa gttagaaatg aggaagagtg 62160 tcctacttct gacactgttc cctcttggaacctgaccgtc aatgctagaa gaacccttgg 62220 aaaacatgct ggcccagccc tctagttttacaaataaggg agtgcacagc cctgagaggt 62280 tacatggcct gcccgagatc acatagtcaatggcagagta aagagcatag cctaggcctc 62340 cccactcctc tagtaatgct ctttcatcttctccaacctg gctctaagcc ttgtccatcc 62400 tgagccccat atctagccca acctagtccctgaaaacagg aagtggccct tagaaatctc 62460 tctccagtcc caccatcaga ggccaactgctgtcttccac tctccttcag cctgtgctcc 62520 tctccctccc tgcctcacag tgccctaagtttcatctctt gccgactgct ttaatacatc 62580 acagtgacat tgtgtgtgtc tctgccacaagactgttgct ccttgatgct ctgggtcacc 62640 tgcatctagc atggcatata tctggtgctcaataaatgtg tattgtatag aattgactga 62700 acttctctca ctggcagccc cctctatccaagtcacctac ctcttttcga aggtgggtga 62760 tgatcttgtg tggaactgag atgacctctccatgactggt ccgaagggca ggcagagttc 62820 ctgatattga gagggaagta taccaaccagacccttagct gcctagtcat acatagttgc 62880 aacacattcc tgcctatttc ttgcttctccccttgtacac acccttcctt acccccaagg 62940 gatactgggt acctgaaggg ctctgccaggggttgctgat cttgtgtacc ttcagtggag 63000 caccagtaaa tctggcatag gtctgcagggaaggaagcag aagtcagaga ggcagagcca 63060 tgtcccccac agtggtatca agaagagaaataacatttat tgaatacctt atgtgccatt 63120 ccctatactt agtatcttag tctcgaaaaaaaagggggag tcactaccat ttctatttta 63180 caaatggaca acacagggct cagagagattccggacatgt ctgtgattac aggacagcca 63240 ggaaaatcct ttcctctatc tcctctcttgccatctacct ggtgaacatc tatcctcaga 63300 agcttttcct aaccagtatc cctcttcttggaagcactga taatgccctt ctctttggca 63360 gtatctgtgg tctgttccta ctttaacttatatgtgaata ttttattata tttatttgct 63420 tacctatctg cctttctgta aactcgaagggaggaaccaa attttattca tctttgcctg 63480 gcaatgttta atatttggaa gaaatctaatagatcctctt taaagcaagg agtcaatgaa 63540 taaatgaagc aaaggaagtc tgattgctaagctctttata caaagcctgg ctgtgagtcg 63600 tttggtctaa caaacaatag cctcagtaaatgctgttaag tgaggaggaa gaggagaagg 63660 gggctgagga gaaggcagga attccaagctagagtaggtc cctgagacag tccagcctag 63720 taaacctccg aatggagaaa aaaaatgcccaaagagaaag gatctccgca aggtcacaca 63780 gccagtgaat ggatgagcaa ggactagaacccatcacctg aactcccaac caggcgtctt 63840 ttcattgctg cattaagcct agaaaactacagcatagggc actgaggcca tatcggccac 63900 gtacgctctt tgactgcctc agtttcccccactgcgcccg tgctgagtag ccctggcaag 63960 gtttggatgc ttggctagtt cggccccctccccagtcatc agggcacagc agagggcgcc 64020 ggcgccaccc ctcaccagca cggccaggctgtccaggtcc actgacggca gcccccagcc 64080 ccctgaccag cagaacagct ccatgggcgccgccatcttg cccaccctct gtcccggaaa 64140 cacttccttg tgtgcttctg ccctcccctgcctgggcccg ccccccgcca ccgcccctcc 64200 gatcgttgcg gtcagggggc ctggggagatccccggggag gcgaggcttc ttctggcccg 64260 actggcagct gaactgcggg ggactgggccgcgggcctcg ggggagggcg gccgccggcc 64320 catccagagg tggcccacgt agcgggacagcgctgtcggc ccggcgcgcc tcggagtgtc 64380 acggcgcctc gtccaagtgg agccccgaacccctgaaggc gcggcaggct ctggagagcg 64440 gggtcttgtg cgcctgggcc aggtctgggggctcttgcca aactgcacgt ggcctgtact 64500 gctccagggc cccttggggc tcgtccccgagcggggactg cgggggggtc ccccgagcag 64560 catgttttcc acagcgcgtt atgtttggagcgggccctgc gccgcctgtc gccatggaaa 64620 caaaacaggg gcggtggcgg cggccggagcggaggccggg ctggggcttg ggtgggggag 64680 gggaagagag gctcgcaggc tgtcgcttaggtgacgggaa ctcaggcgcc cctctgcttc 64740 atccgggtca cggcccgtcc gctagtacccacagtgttcc acagtctggt ccttggctcc 64800 tcgcctgtac ccctggtctt ctgcgcctgtccctggtgtc cctttcctct tttttggttt 64860 cttcactctg acctcactga ccactgctttagattctccc ttcagttccc gtcagacgct 64920 tccagactcc caagctttcc tacgaatgagggaaaatgga gaaacaggca cttgtcaggg 64980 gacccccacc ctaataaaga gcacttgctccgccagaaca gcaaaattca tgccatgtgg 65040 gcatccctgg gcactatagc aagctagttgcggccactcc cttggcatcc tttcctgcca 65100 gctgtggaat aatgcccact gtctagcactgcccctgcca ggggttcttg ccttccacaa 65160 tcgtggcttc cagaaaacag tggcattcggtagcgctgtg tgccgagacc cccaacaatg 65220 atgactgcgg agagggaggc acctggggggaggatcatta gggagaggta gaaagcaggg 65280 aggcctccag gattctttcc cagtgcccctggttcccaga gctgatgatg cctccagggt 65340 gattggcagc tctttgtttc agccccccctcacccgcctg ctggcccccc ctccaattct 65400 gtccctccgc cccccggctc ctgctctctccgcctagcct tttcccctcc cagctgcctg 65460 cctgccaggg gtagtgagcc ggctgagaggcatggagacg caggaacttc ggggggccct 65520 ggctcttctc ctcctttgct ttttcacatctgccagtcag gatctgcagg gtaagcctgt 65580 ctccatcctc ttagaccgct ctctgcttcttccccatttg ccctcagccc aagtagcaga 65640 gaacatgtgg gcaaggggag aggggaagagtccagaaatt gagccagagg aaaacttaag 65700 actgcctaga gttggtgaaa ttatggcctggtggagagga gtgggcaccg gagagtagtg 65760 ggggagctgg aacaggacag ggtcaggcatgaggccaggg cagaggactc aggaactgga 65820 tgctcaggct gcctggctgg ggtggttcctgagcatctgt aggcacccta gggtctggga 65880 gagcagcctg aaggtgggcg tactgtaggtccctgggctt cgtccccctg ggctcctggc 65940 tgcccaagga cagggcgggg gtggggaccaagaagcctgg gctctcccgg aggtctggtg 66000 gtgggatggg ccgatggatg tgggagaggtagctggagca gatgacacag gcatagattc 66060 tttactggct ggagggaaaa ccacagattggccaggacac aggaggcaga ggagctgggt 66120 cctacattcc cgtgggagat agcccagatggtgggagggt tagagtcctc agtgggcagc 66180 tgttcttatc cagagcgagt gtgctgcgcccctaggatgg gaggagggga actagggtgt 66240 ggaacgagcc aaacccagga gtggaaagagaagctgcctt ccatttctac gttgtggaca 66300 ccaggtgcca ctcctgtggg ggatcagcacagcatctcct ttgcgccacc tggtgggggc 66360 atctcaaatt tgtggggtgc tgtttctgtgtttccaagtg agtctcaaac ctgcaggttc 66420 ctggaaggta gtaataggca ataatggggaaaggaaggca cagtgcctct gtcctctgag 66480 agcaacaagg gttgaggttt ttttttgttttcgttttgtc tttttgagat ggagtttcac 66540 tcttgttgcc caggctggag tgcaatggcgcgatctcggc tcactgcaac ctccacctct 66600 caggttcaag cgagtctcct gcctcagcctcctgagtagc tgggattaca ggtgcccgcc 66660 accacgcccg gctaggtggt ttttttttcttttttgttgt tgtttttgta tttttagtag 66720 acacggggtt tcaccgtgtt ggccaggctggtctcgaact cctgacctta ggcgatccac 66780 ctgcctcaac ctcccaaggt gttgggattacaggtgcgag ccaccatgcc cagccaaggg 66840 ttgaattttg aggtgactaa cgctgcctggtggtgggggt agggggtgat gtgggctcgg 66900 ggagggcata ccttagcatt tgatcccattctctaacctt gataacccct gcctgaccag 66960 taattgacct gctgactgtg ggcgagtctcggcagatggt agctgtggca gagaagatcc 67020 ggacagcctt gctcactgct ggggacatctacctcttatc caccttccgc ctgcccccca 67080 agcagggtgg tgtcctcttt ggcctctattctcgccaaga caacactcga tggctggagg 67140 cctctgttgt aggcaagatc aacaaaggtgactggtgggc atctcctttc ttgcaatggt 67200 gacctcctta agcactctcc ctcatccctacttcccaagt gcttcctcat tctccttggc 67260 ctccattagg gaccaaggtc ccatcaggctgtgcccagac atcaggggca cctggtatgg 67320 ggtgcttata aggtgcttta ggagcccatgtaccaaacaa tagggtgagg cttgagtcct 67380 gatatttata gggatgggag gttgtacttggggatgcctg gaaggagggt gaggcatgtc 67440 tctgtgcctc agcggtgata cagaaaggtctttcagatcc ctggaacagt cagtggccac 67500 aaaaccagct gtcaccctag gagaggatacagtcatctga taccaatgaa gactaagaaa 67560 gtatttgtct ctaaaccaat tatttttttaaatggagtct tgctttgtca cctaggctgg 67620 agtgcagtgg tacaatctca gctcactgcaacctctgcct cctgggttca agcaattctc 67680 ctatctcagc ctcctgagta gctgggactacaggtgccca ccaccatgcc tggctaactt 67740 ttgtattttt agtagaaaca gggttttgccatgttgacca ggctgttctc gaacttctga 67800 cctcaagtga tctgcccacc ttggccttctaaagtgctgg gattacaggt gtgagccacc 67860 gtgcctggcc tctaaagcaa tttcaaaaacatttattgtg tacctattaa atacaaggtg 67920 ttgagagcca catagtagaa ttagggataggggccctaga gatgtggtat ctagcagagt 67980 tgatccaagg ctcagctggg aggcagggggtgggggatag cgcccatact gatctcccct 68040 cctgggcagt actggtgcga taccagcgggaggatggcaa agtccacgcc gtgaacctac 68100 agcaagcggg cctggctgat gggcgcacacacacagttct cctgcgactc cgaggtccct 68160 ccagacccag ccctgcccta catctctacgtggactgcaa actgggtgac caacatgcag 68220 gccttccagc actggccccc attcctccagcggaggtcga tgggctggag attaggactg 68280 gacagaaggc gtatttgagg atgcaggtgagccgggagga gcctctgagt tctgtggaaa 68340 tagagtttgc accagctggg gaaggggttggtagaccttc ccaccttact ctgctgctat 68400 ccccccaggg ctttgtggaa tctatgaaaattattctggg tgggtccatg gcccgggtag 68460 gagccctgag tgagtgtcca ttccaaggggacgagtccat ccacagtgca ggtaacacag 68520 agacttgttt gctgacattg gaccacggatcctgtggcct ttggtgactc ctgtcttctt 68580 gatctccctc tcccttaaaa cccattcctttggctcacct gttcctaggg tttctaaccc 68640 tgttattcca aatctttcac ctgactccacaatctccaat acacccatcc gagaaaaaaa 68700 gtgatctaag agaagaattg gttaattgcttggccttggc tgaccaagag atactggtct 68760 cgagtatttt tttttttttt ggtgatggagtcttgtcctg ttgcccaggc cggagtgcag 68820 tggtgcaatc tgggtcactg cagtctccactgagttcaag tgattctcct gcctcagcct 68880 cccaagcagc tgggattaca ggcggccctccaccatgcct agctaatttt gcatttttag 68940 tagagatggg atttcaccac gttggccaggctggtctcaa acacctaatc tcaagtgatg 69000 cacccacctc ggcctctcaa agtgctgggattacaggcgt aagccaccgc gcccggcctg 69060 gtctcgagtc ttttatagtt ctcactggcagctgtcacca gcaatttctc tgagtgttgc 69120 ccatctgcct ggtctctctg acagggatgttcagaagctc tcacctaaat aaaagaccca 69180 cctttcccag atatttgagg gagagctcttgaagaaggga atgggatggc gggtgtggtg 69240 gctcacacct gtaatcccag cactttgggaggctgaggtg ggctgatccc tcaaggtccg 69300 gagttcaaga ccagcctggc caacactgtgaaacctcctc tctactaaaa aatacaaaaa 69360 attagctggg catggtggtg ggcacctgtagtcccagcta cttgggaggc ggaggcagga 69420 gagttgcttg aacccaggag gtggaggttgtggtgagcag agatcacgcc actgcattcc 69480 agcctgggaa acagagcaag actccgactcaaaaaaaaaa aaaatgggga tatactgggg 69540 ccctggccct gctttgggtc catcccttctgccactacca tgcctaggaa ccaggaggat 69600 ttgggttcta acttcctgtg aagcaactcccttagagggc cttttgcccc atagaaggag 69660 ctggcactgc ttgtctgcca gctctgccctcccagcatcc agcaccccat ctttattctg 69720 gggctccagc cctgtccctg tcctcaccttccttcctcct tctcaccaac caggcctctc 69780 ttcacttcac ctcacccctc tgactatgttttcttctcct ctccagtgac caatgcactg 69840 cactccattc taggtgagta ggccacactgaagcggaagc ggggagcggg gaggaggccc 69900 caggctctgg cagctgcctg aaactaagtcctcttcagtc aggaatgtag taggtttaaa 69960 ggcaggggtg ggcagccgtg gcaggtactggcttattgcc catggagggc ccaggactgg 70020 tgctccagta ctgaacccct cacaccctgggtcccgacag gggagcagac caaggcgctg 70080 gtcacccaac tcaccctctt caaccagatcctggtggagc tgcgggatga tatacgagac 70140 caggtttggg tgggctggcg aagggtggcactgattctgg ggtagggtgg cagatgtcaa 70200 gtgctgactc ctccccatcc ttctccaggtgaaggaaatg tccctgatcc gaaacaccat 70260 tatggagtgt caggtgtgcg gtgagtgggagagcagggga ggctccacat gaccgtgcca 70320 cgttcccacc attggctttg gctttcccttctggtggctt aaatagtgac caccgggtag 70380 ctctgactgt gtccacccct caggcttccatgagcagcgt tcccactgca gccccaatcc 70440 ctgcttccga ggtgtggact gcatggaagtgtacgagtac ccaggctacc gctgtgggcc 70500 ctgcccccct ggcctgcagg gcaacggcacccactgcagt gacatcaatg aggtgaggga 70560 ggtcagagcc cagaagggta cagaaaactggggtgaggat gtcaggaggc acccaagagg 70620 gtgggataaa tgctggtccg gaggagaggaatctggagtt taggagaggt cagaggcaag 70680 agaaatgcaa gatgggagag acagaaggcctggggcaaag actgaaggcc atacagggaa 70740 ggagctgggg aaactgcagg ggaggcttgagatggcgacc agtggcatgg ggagggagag 70800 agggaacccc gagggaagtg gggtggggaccaggagcaca aggcagttgt gtggggagag 70860 ctgcacaaag gggagacctg gagcaatggttcctggatca caggcaggga cctgagtttc 70920 ccagagggcg gcctgacctc tgccttctcatctggtcccc agtgtgctca cgctgacccc 70980 tgtttcccgg gctccagctg catcaacaccatgcccggct tccactgtga ggcctgtcct 71040 cgagggtaca agggcacaca ggtgtctggtgtgggcattg actatgcccg ggccagcaaa 71100 caggtcagac tgggtagtgt gtgtggacaagggatcttgg ccttgtagag gccaggggct 71160 tctgggttgg acatgggacc cttgtgatgaatgggacatg gatggctttg catcggcttt 71220 gggtctgggc ttaatgttga tgttcacagataaggccata ccagtgtcct ctgctgtatc 71280 tgaggagacc caccataggt ctggttccacccaaagtctg ccccttcagg tctgcaatga 71340 catcgatgaa tgcaacgatg gcaacaatggtggctgtgac ccaaactcca tctgcaccaa 71400 cactgtggtg agctgaatat cctgagtgtattctggggtg gtgggaatgg taaaacccca 71460 acccccgccc cttcttacct tcaaatttcctgctgccttt cctcctcccc agggcactac 71520 acagattagg tgttcacatg gacttggtttggaatactgg tttgcctctt gctagctctt 71580 tagcaggtta tttaactttt ctgagcctcagcttcctcat ctgaaaactg aggctgttat 71640 ccaccctgaa aggttgtcat gataacctaaaggagatact gatatgcctg gcataactga 71700 gtgcccagcc cacactccgc aggtggagcagtgctggtcc tgggtgcctg gtgggctcac 71760 cctccttcct aagatgctgc ctctccactcttagggctct ttcaagtgtg gtccctgccg 71820 cctgggtttc ctgggcaacc agagccagggctgcctccca gcccggacct gccacagccc 71880 agcccacagc ccctgccaca tccatgctcactgtctcttt gaacgcaatg gtgcagtgtc 71940 ctgccaggtg agctaggctt caggcgtggaaggaaaaggg agggtctggg gaaaggagta 72000 gggctatgtt tagggcctgg gttgggggtcttcataggag agaaggtggg cctgggccca 72060 ggaactgttt ggtggggaga ataggacctgaagcagggaa aatataggga ggaggggagc 72120 cagaccaaac tgctagctcc taccctttgtgttgccctag tgtaacgtgg gctgggctgg 72180 gaatgggaac gtgtgtggga ctgacacagacatcgatggc tacccagacc aagcactgcc 72240 ctgcatggac aacaacaaac actgcaaacaggtgcaggga gcaggcgggc aagggggcgt 72300 agtggggagc ccaagctggg tcaggccagaactcatccat cctcttcccc tgaactttag 72360 gacaactgcc ttttgacacc caactctgggcaggaagatg ctgataatga tggtgtgggg 72420 gaccagtgtg atgatgatgc tgatggggatgggatcaaga atgttgaggt gactcccaga 72480 ctgccctgcc ccttgaagct cctctcccctcctcctgtcc tctctgtgcc tcacttacct 72540 catctggcag ctctctagta agggccaaatactccaaatc agggaggcaa aaacctcgtg 72600 cccaggacaa ggaggctggg tgggtgggactgtactgagc agtttgtcca tcacaagggt 72660 gtgatcttag aaaaggatac agagacaaggtaggtgcaag atgacaagtg atttagggga 72720 gacctgaccc ttcccctccc accctctgcccaggacaact gccggctgtt ccccaacaaa 72780 gaccagcaga actcagatac agattcatttggtgatgcct gtgacaattg ccccaacgtt 72840 cccaacaatg accagaagga cacagatggcaatggggaag gagatgcctg tgacaacgac 72900 gtggatgggg atggtgcagg cctggggctgaaggggtggc tgggggacct gtgagaattt 72960 ggatcaggtg gggatgaagc agggaagctaggaagtctct gtgaaatagg gaggcaggct 73020 tgtggacgtt ggcctgggtg aggagagattacctgcagca gatgtcaata ggaatgtgag 73080 gtagggcgta gtgttaggca gagtgtggactagagggtga gacaagaaac aggcagattt 73140 cctggccagt tgtcctctgg gtggggagacaaagttcggg actttcacca acctagaaga 73200 gagaatatgg catgttctag taacaaccttgtgctaccca tgtcttctag gcatccccaa 73260 tggattggac aattgcccta aagtccccaacccactacag acagacaggg atgaggacgg 73320 ggtgggagat gcttgcgaca gctgccctgaaatgagcaat cctacccagg tacagggaga 73380 tggtaaggac aggggaggga tgagggtactgatggatgaa gccccagccc tttggatgga 73440 aagtggtcag atcaccctct tcagagttatcaagaggaga tggtgagaac aggtccctct 73500 ctctcagaca gatgcagaca gcgacctggtgggggatgtc tgtgatacta atgaagacag 73560 gtaaggtctt ggtcaagaga cgcaaggtctttcttttttt tgtctttctg agacggcttg 73620 ctctgtcacc taggctggag tacagtggcacgatcttggc tcactgcaac ctccgtctcc 73680 cagggtcaag tgattctcat gcctcaacctccctgagtgg ctaggattac aggcatgtgc 73740 taccaagccc agctaattct tgtattttcagtagagacag ggtttcacca tgttggccag 73800 gctggtctcg aactccttac ctcaggtgatccgccagcct ctgcctccca aagtgctgtg 73860 attacaggtg tgagccactg tgccagcgaaatgcaaggtc ttatagggga tattttactt 73920 tcctctagta tatccttttt tttttttttttgagacggag tcttgctctg tcgcccaggc 73980 tggagtacag tggcatgatc tcggctcactgcaagctcca cctcccgagt tcacgccatt 74040 cttctgcctc agcctcccaa gtagctgggactacaggcgc ctgccaccac gcccggctaa 74100 ttttttgtgt ttttagtaga gacggggtttcactgtgtta accaggatgg tctcgatctc 74160 ctgatctcgt gatttgcctg cctcagcctcccaaagtgct gggattacag gcgtgagcca 74220 ccgtgcccgg cctccttttt ttttttgagatggagtcttg ctctgtcact caagctggag 74280 tgcagtgata tcggctcact gcgacctccacctccagggt tcaagcgatt ttcctgcctc 74340 agcctcccag tagctggatt acaggcgcgtgctaccaagc tcagctaatt tttgtgtttt 74400 tagtagagac agggtttcat cgtgttggccaggctggtct cgaactcctg acctcaggtg 74460 atctgcccgc cttggcctcc caaagtactgggaatacagg catgagccac tgtgcccagg 74520 ccatagtata ttctaaattc cttctatgatttagccttaa ttctctattg ctattcagca 74580 ggaatttatt cttacagtca tccctccattcctactgcca gaccttcacc tcacctggct 74640 gactgccagg ggtctgattg tatggagagcaggctccaga gactcccagg cagaagcgaa 74700 gggaaggaca aggagtacct ttaaatcctttatttggtac ctgttcttct gattagcgat 74760 ggggatgggc atcaggacac caaggacaactgcccacagc tgccaaatag ctcccagctg 74820 gactctgata acgatggact tggagatgagtgtgatgggg atgatgacaa tgatggcatc 74880 ccagattatg tgcctcctgg tcccgataactgccgcctgg tacccaatcc caatcagaag 74940 gactcagatg gtaagcctgc ggacccagagcacgttagac tggtgttgcc tttgcccagg 75000 tggaggcagc aagccctgtt gggaagtgaggaagggcaag gtgggaaaga tgtcaggaat 75060 gcaggcccaa cagatggtat tgttgcctaatggcagtggc cagggccttc ctgagcaccc 75120 agcctcactc tgcccaggca atggcgttggtgatgtgtgt gaggatgact ttgacaatga 75180 tgctgtggtc gaccccctgg atgtgtgtcctgaaagtgca gaggtaacgc ttacggattt 75240 tcgggcctat cagaccgtcg tcctggatcc75270 2 1500 DNA Homo sapien 2 atgccgcatc ctcgaaggta ccactcctcagagcgaggca gccgggggag ttaccgtgaa 60 cactatcgga gccgaaagca taagcgacgaagaagtcgct cctggtcaag tagtagtgac 120 cggacacgac ggcgtcggcg agaggacagctaccatgtcc gttctcgaag cagttatgat 180 gatcgttcgt ccgaccggag ggtgtatgaccggcgatact gtggcagcta cagacgcaac 240 gattatagcc gggatcgggg agatgcctactatgacacag actatcggca ttcctatgaa 300 tatcagcggg agaacagcag ttaccgcagccagcgcagca gccggaggaa gcacagacgg 360 cggaggaggc gcagccggac atttagccgctcatcttcgc agcacagcag ccggagagcc 420 aagagtgtag aggacgacgc tgagggccacctcatctacc acgtcgggga ctggctacaa 480 gagcgatatg aaatcgttag caccttaggagaggggacct tcggccgagt tgtacaatgt 540 gttgaccatc gcaggggtgg ggctcgagttgccctgaaga tcattaagaa tgtggagaag 600 tacaaggaag cagctcgact tgagatcaacgtgctagaga aaatcaatga gaaagaccct 660 gacaacaaga acctctgtgt ccagatgtttgactggtttg actaccatgg ccacatgtgt 720 atctcctttg agcttctggg ccttagcaccttcgatttcc tcaaagacaa caactacctg 780 ccctacccca tccaccaagt gcgccacatggccttccagc tgtgccaggc tgtcaagttc 840 ctccatgata acaagctgac acatacagacctcaagcctg aaaatattct gtttgtgaat 900 tcagactatg agctcaccta caacctagagaagaagcgag atgagcgcag tgtgaagagc 960 acagctgtgc gggtggtaga ctttggcagtgccacctttg accatgagca ccatagcacc 1020 attgtctcca ctcgccatta ccgagcaccagaagtcatcc ttgagttggg ctggtcacag 1080 ccttgtgatg tgtggagtat aggctgcatcatctttgaat actatgtggg attcaccctc 1140 ttccagaccc atgacaacag agagcatctagccatgatgg aaaggatctt gggtcctatc 1200 ccttcccgga tgatccgaaa gacaagaaagcagaaatatt tttaccgggg tcgcctggat 1260 tgggatgaga acacatcagc tgggcgctatgttcgtgaga actgcaaacc gctgcggcgg 1320 tatctgacct cagaggcaga ggaacaccaccagctcttcg atctgattga aagcatgcta 1380 gagtatgaac cagctaagcg gctgaccttgggtgaagccc ttcagcatcc tttcttcgcc 1440 cgccttcggg ctgagccgcc caacaagttgtgggactcca gtcgggatat cagtcggtga 1500 3 499 PRT Homo sapien 3 Met ProHis Pro Arg Arg Tyr His Ser Ser Glu Arg Gly Ser Arg Gly 1 5 10 15 SerTyr Arg Glu His Tyr Arg Ser Arg Lys His Lys Arg Arg Arg Ser 20 25 30 ArgSer Trp Ser Ser Ser Ser Asp Arg Thr Arg Arg Arg Arg Arg Glu 35 40 45 AspSer Tyr His Val Arg Ser Arg Ser Ser Tyr Asp Asp Arg Ser Ser 50 55 60 AspArg Arg Val Tyr Asp Arg Arg Tyr Cys Gly Ser Tyr Arg Arg Asn 65 70 75 80Asp Tyr Ser Arg Asp Arg Gly Asp Ala Tyr Tyr Asp Thr Asp Tyr Arg 85 90 95His Ser Tyr Glu Tyr Gln Arg Glu Asn Ser Ser Tyr Arg Ser Gln Arg 100 105110 Ser Ser Arg Arg Lys His Arg Arg Arg Arg Arg Arg Ser Arg Thr Phe 115120 125 Ser Arg Ser Ser Ser Gln His Ser Ser Arg Arg Ala Lys Ser Val Glu130 135 140 Asp Asp Ala Glu Gly His Leu Ile Tyr His Val Gly Asp Trp LeuGln 145 150 155 160 Glu Arg Tyr Glu Ile Val Ser Thr Leu Gly Glu Gly ThrPhe Gly Arg 165 170 175 Val Val Gln Cys Val Asp His Arg Arg Gly Gly AlaArg Val Ala Leu 180 185 190 Lys Ile Ile Lys Asn Val Glu Lys Tyr Lys GluAla Ala Arg Leu Glu 195 200 205 Ile Asn Val Leu Glu Lys Ile Asn Glu LysAsp Pro Asp Asn Lys Asn 210 215 220 Leu Cys Val Gln Met Phe Asp Trp PheAsp Tyr His Gly His Met Cys 225 230 235 240 Ile Ser Phe Glu Leu Leu GlyLeu Ser Thr Phe Asp Phe Leu Lys Asp 245 250 255 Asn Asn Tyr Leu Pro TyrPro Ile His Gln Val Arg His Met Ala Phe 260 265 270 Gln Leu Cys Gln AlaVal Lys Phe Leu His Asp Asn Lys Leu Thr His 275 280 285 Thr Asp Leu LysPro Glu Asn Ile Leu Phe Val Asn Ser Asp Tyr Glu 290 295 300 Leu Thr TyrAsn Leu Glu Lys Lys Arg Asp Glu Arg Ser Val Lys Ser 305 310 315 320 ThrAla Val Arg Val Val Asp Phe Gly Ser Ala Thr Phe Asp His Glu 325 330 335His His Ser Thr Ile Val Ser Thr Arg His Tyr Arg Ala Pro Glu Val 340 345350 Ile Leu Glu Leu Gly Trp Ser Gln Pro Cys Asp Val Trp Ser Ile Gly 355360 365 Cys Ile Ile Phe Glu Tyr Tyr Val Gly Phe Thr Leu Phe Gln Thr His370 375 380 Asp Asn Arg Glu His Leu Ala Met Met Glu Arg Ile Leu Gly ProIle 385 390 395 400 Pro Ser Arg Met Ile Arg Lys Thr Arg Lys Gln Lys TyrPhe Tyr Arg 405 410 415 Gly Arg Leu Asp Trp Asp Glu Asn Thr Ser Ala GlyArg Tyr Val Arg 420 425 430 Glu Asn Cys Lys Pro Leu Arg Arg Tyr Leu ThrSer Glu Ala Glu Glu 435 440 445 His His Gln Leu Phe Asp Leu Ile Glu SerMet Leu Glu Tyr Glu Pro 450 455 460 Ala Lys Arg Leu Thr Leu Gly Glu AlaLeu Gln His Pro Phe Phe Ala 465 470 475 480 Arg Leu Arg Ala Glu Pro ProAsn Lys Leu Trp Asp Ser Ser Arg Asp 485 490 495 Ile Ser Arg 4 1044 DNAHomo sapien 4 atggctcaga gcagagacgg cggaaacccg ttcgccgagc ccagcgagcttgacaacccc 60 tttcaggacc cagctgtgat ccagcaccga cccagccggc agtatgccacgcgtgacgtc 120 tacaaccctt ttgagacccg ggagccacca ccagcctatg agcctccagcccctgcccca 180 ttgcctccac cctcagctcc ctccttgcag ccctcgagaa agctcagccccacagaacct 240 aagaactatg gctcatacag cactcaggcc tcagctgcag cagccacagctgagctgctg 300 aagaaacagg aggagctcaa ccggaaggca gaggagttgg accgaagggagcgagagctg 360 cagcatgctg ccctgggggg cacagctact cgacagaaca attggccccctctaccttct 420 ttttgtccag ttcagccctg ctttttccag gacatctcca tggagatcccccaagaattt 480 cagaagactg tatccaccat gtactacctc tggatgtgca gcacgctggctcttctcctg 540 aacttcctcg cctgcctggc cagcttctgt gtggaaacca acaatggcgcaggctttggg 600 ctttctatcc tctgggtcct ccttttcact ccctgctcct ttgtctgctggtaccgcccc 660 atgtataagg ctttccggag tgacagttca ttcaatttct tcgctttcttcttcaatttc 720 ttcgaccagg atgtgctctt tgtcctccag gccattggta tcccaggttggggattcagt 780 ggctggatct ctgctctggt ggtgccgaag ggcaacacag cagtatccgtgctcatgctg 840 ctggtcgccc tgctcttcac tggcattgct gtgctaggaa ttgtcatgctgaaacggatc 900 cactccttat accgccgcac aggtgccagc tttcagaagg cccagcaagaatttgctgct 960 ggtgtcttct ccaaccctgc ggtgcgaacc cgagctgcca atgcagccgctggggctgct 1020 gaaaatgcct tccgggcccc gtga 1044 5 347 PRT Homo sapien 5Met Ala Gln Ser Arg Asp Gly Gly Asn Pro Phe Ala Glu Pro Ser Glu 1 5 1015 Leu Asp Asn Pro Phe Gln Asp Pro Ala Val Ile Gln His Arg Pro Ser 20 2530 Arg Gln Tyr Ala Thr Arg Asp Val Tyr Asn Pro Phe Glu Thr Arg Glu 35 4045 Pro Pro Pro Ala Tyr Glu Pro Pro Ala Pro Ala Pro Leu Pro Pro Pro 50 5560 Ser Ala Pro Ser Leu Gln Pro Ser Arg Lys Leu Ser Pro Thr Glu Pro 65 7075 80 Lys Asn Tyr Gly Ser Tyr Ser Thr Gln Ala Ser Ala Ala Ala Ala Thr 8590 95 Ala Glu Leu Leu Lys Lys Gln Glu Glu Leu Asn Arg Lys Ala Glu Glu100 105 110 Leu Asp Arg Arg Glu Arg Glu Leu Gln His Ala Ala Leu Gly GlyThr 115 120 125 Ala Thr Arg Gln Asn Asn Trp Pro Pro Leu Pro Ser Phe CysPro Val 130 135 140 Gln Pro Cys Phe Phe Gln Asp Ile Ser Met Glu Ile ProGln Glu Phe 145 150 155 160 Gln Lys Thr Val Ser Thr Met Tyr Tyr Leu TrpMet Cys Ser Thr Leu 165 170 175 Ala Leu Leu Leu Asn Phe Leu Ala Cys LeuAla Ser Phe Cys Val Glu 180 185 190 Thr Asn Asn Gly Ala Gly Phe Gly LeuSer Ile Leu Trp Val Leu Leu 195 200 205 Phe Thr Pro Cys Ser Phe Val CysTrp Tyr Arg Pro Met Tyr Lys Ala 210 215 220 Phe Arg Ser Asp Ser Ser PheAsn Phe Phe Ala Phe Phe Phe Asn Phe 225 230 235 240 Phe Asp Gln Asp ValLeu Phe Val Leu Gln Ala Ile Gly Ile Pro Gly 245 250 255 Trp Gly Phe SerGly Trp Ile Ser Ala Leu Val Val Pro Lys Gly Asn 260 265 270 Thr Ala ValSer Val Leu Met Leu Leu Val Ala Leu Leu Phe Thr Gly 275 280 285 Ile AlaVal Leu Gly Ile Val Met Leu Lys Arg Ile His Ser Leu Tyr 290 295 300 ArgArg Thr Gly Ala Ser Phe Gln Lys Ala Gln Gln Glu Phe Ala Ala 305 310 315320 Gly Val Phe Ser Asn Pro Ala Val Arg Thr Arg Ala Ala Asn Ala Ala 325330 335 Ala Gly Ala Ala Glu Asn Ala Phe Arg Ala Pro 340 345 6 2010 DNAHomo sapien 6 atgatgccct cgcctagtga ctccagccgc tcgctgacca gccggcccagcaccaggggc 60 cttacccacc tccgcctcca ccgaccctgg ctgcaggccc tgcttacgctggggctggtc 120 caagtgctcc tgggcatcct ggtggtcacc ttcagcatgg tggcctcttccgtcaccacc 180 accgagagca tcaagaggtc ctgcccgtct tgggctgggt tctcgctggcgttctccggg 240 gtggttggca ttgtgtcctg gaagcggcca ttcactctag tgatctccttcttctccttg 300 ctttcggtgc tctgtgtcat gcttagcatg gctggctctg ttctctcctgtaagaatgct 360 caactggccc gagacttcca acagtgctct ctggaaggaa aggtctgtgtgtgctgtccc 420 tctgttcccc tcctccggcc ctgtccagag tcggggcagg aactgaaagttgcccctaac 480 tccacctgtg atgaagcccg aggggccctc aagaacctgc tcttcagcgtctgtgggctc 540 accatttgtg ccgctataat ctgtacactc tctgctattg tctgctgcatccaaatcttc 600 tccctggacc tcgtgcatac gcagctggcc cctgagcggt cagtctcaggcccactggga 660 cctctgggct gcacgtcccc gcccccagcc cctctcctac acaccatgctggacctggag 720 gaatttgtcc cgcctgtgcc cccaccgccc tactatcccc cagagtatacctgcagctca 780 gaaacagatg cacagagcat cacgtacaat ggctccatgg acagcccagtgcccttgtac 840 cctaccgatt gccccccttc ttatgaggca gtcatgggac tacgaggagacagccaggcc 900 actctctttg accctcagct tcacgatggc tcgtgcatct gtgaacgagtggcctccatt 960 gtagacgtgt ccatggacag cgggtctctg gtgctgtcag ccattggtgacctccctggg 1020 ggctctagcc cgtcggagga ctcgtgcctg ctggagctgc agggctccgtgcgctccgtg 1080 gactacgttc tctttcgctc catccagcgc agccgtgccg gctactgcctcagcctggac 1140 tgtggcctgc ggggcccctt cgaggaaagc cccctgccac ggcgccccccacgggctgcc 1200 cgctcctatt cctgctctgc ccctgaagct ccacccccac tgggtgcccccacagctgcc 1260 cgcagctgcc accggttgga gggctggccg ccctgggtgg gaccctgcttccccgagctg 1320 aggcggcggg tcccccgggg agggggccgc ccagccgcag ccccgcccacccgagccccg 1380 actcgtcgct tcagcgatag ctcaggttcc ctcaccccac cggggcaccggcctcctcat 1440 ccggcatccc caccaccgct gctgctgcca cggtcccaca gcgacccaggcatcacgacc 1500 tccagtgaca ctgctgactt cagggacctt tataccaaag tgcttgaggaagaagctgct 1560 tctgtttcct ctgcagatac agggctctgc tctgaagcct gcctcttccgcctagcccgc 1620 tgcccttccc ccaagttgct acgtgcccgg tcagccgaga aacggcgccctgtgcccacc 1680 ttccaaaaag ttcccctgcc ctcgggccct gcacctgccc actccctgggggacctaaag 1740 ggcagctggc caggtcgggg cctggtcact cgtttcctcc agatatccaggaaagcccca 1800 gaccccagtg ggactggagc tcatggacat aagcaggtgc cccggagcctgtggggccgg 1860 cctggccgag agagcctcca ccttcgcagc tgcggagatc tgagctctagctcttccctg 1920 cggcgtctcc tgtctggccg caggctggag cgtggtaccc gcccccacagcctcagcctc 1980 aacgggggca gccgggagac tgggctctga 2010 7 669 PRT Homosapien 7 Met Met Pro Ser Pro Ser Asp Ser Ser Arg Ser Leu Thr Ser Arg Pro1 5 10 15 Ser Thr Arg Gly Leu Thr His Leu Arg Leu His Arg Pro Trp LeuGln 20 25 30 Ala Leu Leu Thr Leu Gly Leu Val Gln Val Leu Leu Gly Ile LeuVal 35 40 45 Val Thr Phe Ser Met Val Ala Ser Ser Val Thr Thr Thr Glu SerIle 50 55 60 Lys Arg Ser Cys Pro Ser Trp Ala Gly Phe Ser Leu Ala Phe SerGly 65 70 75 80 Val Val Gly Ile Val Ser Trp Lys Arg Pro Phe Thr Leu ValIle Ser 85 90 95 Phe Phe Ser Leu Leu Ser Val Leu Cys Val Met Leu Ser MetAla Gly 100 105 110 Ser Val Leu Ser Cys Lys Asn Ala Gln Leu Ala Arg AspPhe Gln Gln 115 120 125 Cys Ser Leu Glu Gly Lys Val Cys Val Cys Cys ProSer Val Pro Leu 130 135 140 Leu Arg Pro Cys Pro Glu Ser Gly Gln Glu LeuLys Val Ala Pro Asn 145 150 155 160 Ser Thr Cys Asp Glu Ala Arg Gly AlaLeu Lys Asn Leu Leu Phe Ser 165 170 175 Val Cys Gly Leu Thr Ile Cys AlaAla Ile Ile Cys Thr Leu Ser Ala 180 185 190 Ile Val Cys Cys Ile Gln IlePhe Ser Leu Asp Leu Val His Thr Gln 195 200 205 Leu Ala Pro Glu Arg SerVal Ser Gly Pro Leu Gly Pro Leu Gly Cys 210 215 220 Thr Ser Pro Pro ProAla Pro Leu Leu His Thr Met Leu Asp Leu Glu 225 230 235 240 Glu Phe ValPro Pro Val Pro Pro Pro Pro Tyr Tyr Pro Pro Glu Tyr 245 250 255 Thr CysSer Ser Glu Thr Asp Ala Gln Ser Ile Thr Tyr Asn Gly Ser 260 265 270 MetAsp Ser Pro Val Pro Leu Tyr Pro Thr Asp Cys Pro Pro Ser Tyr 275 280 285Glu Ala Val Met Gly Leu Arg Gly Asp Ser Gln Ala Thr Leu Phe Asp 290 295300 Pro Gln Leu His Asp Gly Ser Cys Ile Cys Glu Arg Val Ala Ser Ile 305310 315 320 Val Asp Val Ser Met Asp Ser Gly Ser Leu Val Leu Ser Ala IleGly 325 330 335 Asp Leu Pro Gly Gly Ser Ser Pro Ser Glu Asp Ser Cys LeuLeu Glu 340 345 350 Leu Gln Gly Ser Val Arg Ser Val Asp Tyr Val Leu PheArg Ser Ile 355 360 365 Gln Arg Ser Arg Ala Gly Tyr Cys Leu Ser Leu AspCys Gly Leu Arg 370 375 380 Gly Pro Phe Glu Glu Ser Pro Leu Pro Arg ArgPro Pro Arg Ala Ala 385 390 395 400 Arg Ser Tyr Ser Cys Ser Ala Pro GluAla Pro Pro Pro Leu Gly Ala 405 410 415 Pro Thr Ala Ala Arg Ser Cys HisArg Leu Glu Gly Trp Pro Pro Trp 420 425 430 Val Gly Pro Cys Phe Pro GluLeu Arg Arg Arg Val Pro Arg Gly Gly 435 440 445 Gly Arg Pro Ala Ala AlaPro Pro Thr Arg Ala Pro Thr Arg Arg Phe 450 455 460 Ser Asp Ser Ser GlySer Leu Thr Pro Pro Gly His Arg Pro Pro His 465 470 475 480 Pro Ala SerPro Pro Pro Leu Leu Leu Pro Arg Ser His Ser Asp Pro 485 490 495 Gly IleThr Thr Ser Ser Asp Thr Ala Asp Phe Arg Asp Leu Tyr Thr 500 505 510 LysVal Leu Glu Glu Glu Ala Ala Ser Val Ser Ser Ala Asp Thr Gly 515 520 525Leu Cys Ser Glu Ala Cys Leu Phe Arg Leu Ala Arg Cys Pro Ser Pro 530 535540 Lys Leu Leu Arg Ala Arg Ser Ala Glu Lys Arg Arg Pro Val Pro Thr 545550 555 560 Phe Gln Lys Val Pro Leu Pro Ser Gly Pro Ala Pro Ala His SerLeu 565 570 575 Gly Asp Leu Lys Gly Ser Trp Pro Gly Arg Gly Leu Val ThrArg Phe 580 585 590 Leu Gln Ile Ser Arg Lys Ala Pro Asp Pro Ser Gly ThrGly Ala His 595 600 605 Gly His Lys Gln Val Pro Arg Ser Leu Trp Gly ArgPro Gly Arg Glu 610 615 620 Ser Leu His Leu Arg Ser Cys Gly Asp Leu SerSer Ser Ser Ser Leu 625 630 635 640 Arg Arg Leu Leu Ser Gly Arg Arg LeuGlu Arg Gly Thr Arg Pro His 645 650 655 Ser Leu Ser Leu Asn Gly Gly SerArg Glu Thr Gly Leu 660 665 8 6439 DNA Homo sapien 8 ggtagttggttgtgggcact gggttagagg tatcacgtgg gggcactttc gtcttagctt 60 ttggacaagacgcaggcgca aacccacggc tgctgcgggg gatccttgtg gccctttccg 120 gtcggtggaaccaatccgtg caacagagaa gcggggcgaa ctgaggcgag tgaagtggac 180 tctgagggctaccgctaccg ccactgctgc ggcaggggcg tggagggcag agggccgcgg 240 aggccgcagttgcaaacatg gctcagagca gagacggcgg aaacccgttc gccgagccca 300 gcgagcttgacaaccccttt caggtgactt gcgccagtcg gcctcttttg ggcggtcagg 360 ttgattcttcccggttctgt agggtcgggc taacttgtat ccccatttgt gacatttgat 420 cctgggaagagccgccacgt ggggtgacag tgactccaga ccagtgagca ctctgggggg 480 cgggccctgccccttaatgg gctggtgctg gcagtgttgg atgatggatt tgaggtaaat 540 gttgtcccagtcctgggaca acggcgtggc cccgaaggtg cagtacttgg agccacaggc 600 agtttgggaatggtccctgg gaattcccct aggtggcact gggtgccagc tgagacccgg 660 gtctctgccctcaggaccca gctgtgatcc agcaccgacc cagccggcag tatgccacgc 720 gtgacgtctacaaccctttt gagacccggg aggtgagcta ctggagagca taggagttca 780 aggaagggaagggtttgcca gtgagacaag ttagcttggg tacaggggac ttttctgcat 840 gcacaggaaggaagaggcac tgtgatctta gttccctgag agaggctagt cagccatggg 900 atgcctcatccttctagggc caccagggcc accaacccaa atggtttgag ctctaaaagt 960 cagtcctgggaagatctgga ggctggcaag gccttaacca atgttctgta tgggaaggct 1020 ttcttgaagaaaataaaggt tggggctggt tggagtttag gttaggtgtg cagtttgaaa 1080 gcaaggaatgaggaggacta cacctgtttg tcccactgga ttagaagatg gtgagttgta 1140 agaatttaaggggttttcat ggtacttaaa acttaaaaaa aaaagccaag catggtgtgc 1200 atgcctgtaatcccacttgg gaggctgagg tgggaggatt gcttgagccc aggaatttga 1260 tggagaccatcctgggcaac agcgagaccc ccatctcttg agacgtttta aaaattggct 1320 aggtgtagtggcacatgcct gtagtccttg ctacttggga ggctgggatg gaaagactgc 1380 ttgggcccaggagtttgagg cttcgatgag tgatgattgc accactgcac tccagcctgg 1440 gtggcagagagacaacccca cactccatcc ccagaagaaa agagaaaaaa aaaacacagt 1500 taagtggggagatatggtaa atgagtgatt tgagtcctca ctaaggaatt ggatatgagg 1560 gatgataaggctgctacagg tcatcataga gttctctgac agaaattcaa cagaagaact 1620 agcttcctaacaaatggggg aaaggctgct tccttacatc atgagttctg ggtctaaagg 1680 tattcaagcagactatagat tcctgtgtta tgagggacta gatgatcact aaggtctttt 1740 cctcaacagccaccaccagc ctatgagcct ccagcccctg ccccattgcc tccaccctca 1800 gctccctccttgcagccctc gagaaagctc agccccacag aacctaagaa ctatggctca 1860 tacagcactcaggtacagga ggtgtgcagg tgagggctgg ggagaagggc cagtcctggg 1920 gcccagggctcacatctccg tctgctcaca ctgggcaggc ctcagctgca gcagccacag 1980 ctgagctgctgaagaaacag gaggagctca accggaaggc agaggagttg gaccgaaggg 2040 agcgagagctgcagcatgct gccctggggg gcacagctag taagtaatag agtggggaag 2100 agcatctgagagtttgggag gagcaaaaac aggtcttaag ggcctgggct cggctgggtg 2160 cggtggctcacacctataat cccagcactt tgggaggctg aggtgggcgg atcacgaggt 2220 cacgaggtcaagaaatcgag accatcctag ccaacatggt gaaaccctgt gtctactaaa 2280 aatacacacacacaaaaatt agctgggcat ggtggcgcgc acctgtagtc ccagctactc 2340 gggaggctgaggcagggaga atcgcttgaa cccgggaggt ggaggttgca gtgatccgaa 2400 atacgcgccactgcactcca gcctggtgac agaaatgaga ctccatctca aaaaaaaaaa 2460 aacacaaacctgggctctgg agtcagactg ctgagtttga attttagaga tactgcttac 2520 taaccatgagcttttaggga agttacttaa tcttgaatgc ctcagtttcc ctatttataa 2580 aatgaaaaccacgtttgcat ccgtttgaca gggttgttgt gaaggttaaa caaaatatat 2640 gtgaagtacttggtacagtg cttagatgtt ttaaataata ataaacagta ttaatttttc 2700 cactctcacttgtctccttg gaccccaaat tggagataaa agagaggatc cagggctggg 2760 tatggtgcctcacgcctgta attccaacac tttgggaggc cgaggcaggc aggttgcttg 2820 agttcaggagttagagacca acctggacaa catagtgaga tcctgcctcc aaaacattaa 2880 tgaaaaaaattagcagggca tgatgtacct gtctgtagtc ctagctcctc aagaagctga 2940 ggtggacggccaggcgccgg ggctcatgcc tgtaatccca gcactctggg aggccaaggc 3000 gggtggatcacctgaggtca ggagttcagg accagccaac atggtgaaac cctgtctcta 3060 ctaaaaatacaaaaattagc tgggcgtggt ggcatgtgcc cataatccca gctactcagg 3120 agactgaggcaggagaatta cttgaacccg ggaggcggag attgcactga gccaagatca 3180 caccactgcactccaacctg ggcaacaaga gcgaaactct atctcaggag aaaaaaaaaa 3240 aaaaaaaaaagctgaggtgg gagggttgct tgagcccagg aggttgaggc tacagtgaac 3300 catgatcataccactacctt ccagcctgaa caacagagac cctatctcaa aaaaaaaaaa 3360 aaaaaaaaaagagaggatcc agggatggag agaaggggga gtgattcctt tgttggtctc 3420 tgttttacttctggggtcca cctgtttttt ggcgccttac agctcgacag aacaattggc 3480 cccctctaccttctttttgt ccagttcagc cctgcttttt ccaggacatc tccatggaga 3540 tcccccaagaatttcagaag actgtatcca ccatgtacta cctctggatg tgtgagtagt 3600 gagaagccttttggaggaag ttacaggtag atctcttaac tgccctgggg tccgctcacc 3660 aagaaccaaacacttcacct ctatttagaa ctcaccagcc tgctagcaaa tgttcttgcc 3720 ctctccccactttttattgt ccatatgcag gaatatattt tgaattcttt agatgtcttt 3780 gggctgggtgcagtggtata cgcctgtaat cccagcactt tgggaagctg aggtgggtgg 3840 ataacctgaggtcagaagtt tgagactagc ctgatcaaca tggagaaacc ccatctctac 3900 taaaaatacaaaattacctg ggcgtggtgg cacatgcctg taatcctagc tactcaggag 3960 gctgaagcaggagaatcact tgaacccggg aagtggaggt tgcaatgagc caagatcatg 4020 ccattgcactccagcctggg caacaagagc aaaactccat ctcaaaaaaa aaaaaaaaaa 4080 aaaaaaaaaaaaaaaaaagg ccgggcacgg tggctcacac ctgtaatccc agcactttgg 4140 gaggcccaggtgggcagaac atgaggttag gagatcaaca ccatcctggc taacacggtg 4200 aaaccccgtctctactacaa atacaaaaaa ttagccgggc gtggtggcgg gtgcctgtag 4260 tcccagctacttaggaggct gaggtaggag aatgggcgtg aacccaggag gcagagcttg 4320 cagtgagccgagatcctgcc actgcactcc agcctgggcg acagagcgag attccatctc 4380 aaaaaaaaaaaaagaaaaaa aaaaattagc tgggcgtggt ggcgggcgcc tgtagtccca 4440 gctactctggaggctgaggc aggagaatgg cgtgaaccgg ggaggcggag cttgcagtaa 4500 gctgagattgcgccactgca ctccagcctg cgcgacagag ccagactccg tctcaaaaaa 4560 aaaaatgtctttgcaagggg atcacacatt attgacattt gctttctcca tctcccctgt 4620 tgaggattccatgaaggcag cagctgcctt catttcctta ctctgccatg tttggtgaat 4680 attataggatgagcatagat gggaaggagc cttcattgca gtccagaagg gctcctcatc 4740 ctgtccctctgccccttagg cagcacgctg gctcttctcc tgaacttcct cgcctgcctg 4800 gccagcttctgtgtggaaac caacaatggc gcaggctttg ggctttctat cctctgggtc 4860 ctccttttcactccctgctc ctttgtctgc tggtaccgcc ccatgtataa ggctttccgg 4920 taagtgtgttagtggtggga gagtgatgga gacctgggat gggccccacg tctgcccatc 4980 cttcagctctaattcttctc ccaccctccc catttttttc ctctttgtag gagtgacagt 5040 tcattcaatttcttcgcttt cttcttcaat ttcttcgacc aggatgtgct ctttgtcctc 5100 caggccattggtatcccagg ttggggattc aggtttgtga ggctgttatc caccctcacc 5160 tttccctctagatccagcca gcactgggtg ctggatagga gttgttcaga aaaaggaaat 5220 gtggttttaatccttgggag gtactagttt aatgagatac aagacatact tccaggatag 5280 agcgcagacagcactcttga cacatataga ttgaagggaa gaatgctgca tttggccaat 5340 ctaaggtggcttcctggagg aggcatagaa ccatggttga aaggaggaag aagaatcccc 5400 agatgagtgcctggaagagc ttggagagct cagggctaat ggttcagaaa ctggaattaa 5460 actatgaagagattagatac agtttgggat tgtggtgctt ggaatgctgc tcgattgaca 5520 gggagccactgctgatggga agggtaggaa caggggatgc tggtgacata accagtggag 5580 aagctgaggagcccctcttc actggtacat ccttcccttt acagtggctg gatctctgct 5640 ctggtggtgccgaagggcaa cacagcagta tccgtgctca tgctgctggt cgccctgctc 5700 ttcactggcattgctgtgct aggaattgtc atgctgaaac gggtgagggc tgtgtcgaag 5760 gtggggccgggatggtgaga tcatgggtcc ccaggggcgt gggtggaaca ttcaggagca 5820 actggcacaggtcaggctgc tgggttgttc tcagctaatg gacctctggg gtgtgtgttt 5880 ctgtgtgtgagtgtgtgtgc tgggcagcag gctgctgagt ggtagtgatg ctgttaggct 5940 gggggtggggaaccagtggc tggaatgggc ggtaatgtct ttgtcctcta cttgcagatc 6000 cactccttataccgccgcac aggtgccagc tttcagaagg cccagcaaga atttgctgct 6060 ggtgtcttctccaaccctgc ggtgcgaacc cgagctgcca atgcagccgc tggggctgct 6120 gaaaatgccttccgggcccc gtgacccctg actgggatgc cctggccctg ctacttgagg 6180 gagctgacttagctcccgtc cctaaggtct ctgggacttg gagagacatc actaactgat 6240 ggctcctccgtagtgctccc aatcctatgg ccatgactgc tgaacctgac aggcgtgtgg 6300 ggagttcactgtgacctagt ccccccatca ggccacactg ctgccacctc tcacacgccc 6360 caacccagcttccctctgct gtgccacggc tgttgcttcg gttatttaaa taaaaagaaa 6420 gaggaactggaactgacat 6439 9 8253 DNA Homo sapien 9 cctgcggcag ccggagctcg gggagcggagcgtggtgggg aggggagcgg gacaggcgac 60 acaggagaca gcggcgccgc ggcctctccccaccaggcgg ccccggatcc tactggacgc 120 cctgagggca caccgaccgc gcctctagagtcaccccacg ccgacccctc ccctcttctc 180 tagacttatt tccatccttc ccgcttttaccctccccacc cgtccctggg ctccaggccg 240 ccgccccctc ctcactcctg gaccggcccttctcggtgcc cctcttccct agggagatgc 300 gatgagccgg tgcccccgcg tcctcatcgtcgccccgggc acggtgcccg tccagtgccc 360 gtggtgggga gggagcactc cgcggtccctccgtgacgcc cctcgcttgg ccccccccac 420 agctggcgtc cctcggccat gccccaggggacccagccag ggggtgggct ctagagcgag 480 tggggtggag aggagaaagg acggggccttgggggcctct gagatgctcc caagtgccag 540 ggagggccga gcgaggcgca ggcaaccgggcagcaggcat gatgccctcg cctagtgact 600 ccagccgctc gctgaccagc cggcccagcaccaggggcct tacccacctc cgcctccacc 660 gaccctggct gcaggccctg cttacgctggggctggtcca agtgctcctg ggcatcctgg 720 tggtcacctt cagcatggtg gcctcttccgtcaccaccac cgagagcatc aagaggtcct 780 gcccgtcttg ggctgggttc tcggtgagttgggtgcacag ttgttgggtg ggggaggctc 840 ctcgggcccc accctccaca ccagctgcaagtccacatgc ttgcctctcc tccctctcct 900 ggtcctctgc ctccttacag ggctgggtgcatcctgtggt gaggggctca ctcaggagcc 960 tccccctgcc aggctgagcc tgtgcccactctgtccccag ctggcgttct ccggggtggt 1020 tggcattgtg tcctggaagc ggccattcactctagtggta ggtgccaggg tccagtgccc 1080 actgggaggc aggtgcccag cacgcaaggggaagcccatt tatgacctca agaagggaac 1140 tggctcccca gttggtgcca gccgggtgggcacgaagttc tgtgaaggag ggggactctg 1200 tccgtggcag aggagtactc atgagggtcccagccctgag tctgcaccct gttttcccca 1260 gatctccttc ttctccttgc tttcggtgctctgtgtcatg cttagcatgg ctggctctgt 1320 tctctcctgt aagaatgctc aactggcccgagacttccaa cagtgctctc tggtgagatt 1380 tgaggaggga gagctggaaa gaactggctgggggaggtgt gcaggacacc tcagtttgtg 1440 ctgactcagg ctgcctcacc ctccctgctccactcaggaa ggaaaggtct gtgtgtgctg 1500 tccctctgtt cccctcctcc ggccctgtccagagtcgggg caggaactga aagttgcccc 1560 taactccacc tgtgatgaag cccgaggggccctcaaggtg agcttgcacc ctgcaaacat 1620 cctcctggtt ctcactcttg cctcccctgctggggatctc acaggcccat aatgtgtgga 1680 acttagccgt ctcctgactc ctctcctcctttcccttccc cagaacctgc tcttcagcgt 1740 ctgtgggctc accatttgtg ccgctataatctgtacactc tctgctattg tctgctgcat 1800 ccaaatcttc tccctggacc tcgtgcatacggtgagaagg gagcaggggc cagggcacgc 1860 aggtatggtg ggggcagggg tgtgtgtgctgagacttgcc tgagggaaca atggctacag 1920 atctggcttt tgagcaccag gggctatgggttatctatta tcctcatcta ttggaggaat 1980 ggatgtttag tggggaggat gagaaggggagatggcaggg agtgagttaa gtatgtgatg 2040 ctggtctggg accaggagag ggtgctttagagaatgggac tggatggtgg ggtagagtca 2100 agaaggtcct atgctgggca cggtggctcattcctgtaat cccagcactt tgagaggccc 2160 aggcaggcgg atcacctgag gccaggagttcaagacccac ctggccaaca tggcgaaacc 2220 ctgtctctat taaaaataca aaaatcgggccgggcgcagt ggctcgtgcc tataatccca 2280 gcactttggg aggctgagga gggcagatcacctgagatca ggagatcgag actatcctgg 2340 ctaacatggt gaaaccccat ctctactaaacatacaaaaa attagccagg cgtggtggcg 2400 ggcgcctgtg gtcccagcta ctcgggaggctgaggcagga gaatggcatg aacctgggag 2460 gcagagcttg cagtgagctg agatagtgccactgcactcc agcctgggcg acagagcaag 2520 actccatctc aaaaaacaaa caaacgaaatacaaaaatta gccaagtgtg gtggcgggtg 2580 cctgtaatcc cagctacttg ggagtctgaggcaggaaaat tgcttgaacc ggggagtcag 2640 aggctgcagt gagctgagat ggtgccactgcattccagcc tgggcgacaa gagcacagac 2700 tccgtctcaa aaaaaaaaaa aaaaaaaaaaaaaaaatata tatatatata tatatatata 2760 tatatataca cacaaaatta caaaaattaggctgggctcc gtggctcatg cctgtaatcc 2820 cagcactttg ggaggccaag gcaggaggatcaccagatat caggagtttg acaccagcct 2880 ggccaacatg gcgaaaccct atctctactaaaaatacaaa aattatccgg gtgtggtggc 2940 gggtgcctgt aatcccagct actcggaagactgaggctgg agaatcgctt gaacctggga 3000 ggcagaggtt gcagtgagct gagatgtagccattgtactc cagcctgggc gacaagagtg 3060 aaacttcgtc tcgaaataat aataataataataattagct gggcatggtt gcacaccctt 3120 ataattcctg ctactcagga ggatgaggcatgggaattgc ttgatcttgg gaggtggggg 3180 ttgcagtgag ctgagatcgc gccactgcactccagcaaca gagtcagact ctgtctcaaa 3240 aaaaaaaaga aggtcctaga tggaggtgaggctaaagggt ggacattcct gtgaagacac 3300 agaatgtgtg gagcttctgg atggaggtggggccataaag aggaatttgt gggtgggcag 3360 ggccagaggc tagaggtaca gggtgacggtggggttaagg agagctgatt ttgggtgagg 3420 gaggggccag aataagagct tctctcaggggatagggcca gaaaaaataa tgtagatgag 3480 agaggagttt ggggcccagg aggagctgtattcccagaat ccagacttgc tgacctgctc 3540 gcttccccag cagctggccc ctgagcggtcagtctcaggc ccactgggac ctctgggctg 3600 cacgtccccg cccccagccc ctctcctacacaccatgctg gacctggagg aatttgtccc 3660 gcctgtgccc ccaccgccct actatcccccagagtatacc tgcagctcag aaacagatgc 3720 acagaggtaa ggcctggtgg ggtcttgctgggggagctaa ggaaggggac tggggctggg 3780 cctggattgc tggagagagg gatctttgtggagggggaat tatttctcct acattgaccc 3840 tcttcctcat ctgccagcat cacgtacaatggctccatgg acagcccagt gcccttgtac 3900 cctaccgatt gccccccttc ttatgaggcagtcatgggac tacgaggaga cagccaggtg 3960 agagcacagc acggcttggg gcgggctggggagccgggtt gtagcctgga aagctgaaca 4020 ggctgtagcc tctggataaa gatagtaacagtggtcaagg tctttacagc accagcatgc 4080 ccactgttgc ctttgatctt ccctagagcctggtgaggtg agtgggttgg aagctatcat 4140 ttctgccata cagacgtgga agctgaggctgcacaattaa gtgacttgta caaagggaca 4200 aggctggtct ggaatctaag tctccagagctctgtctagc acatgggggt gttcagtgtg 4260 tagggctgaa cccttgaccc tgtgtcttctgcaggccact ctctttgacc ctcagcttca 4320 cgatggctcg tgcatctgtg aacgagtggcctccattgta gacggtgagc agggcgtaat 4380 gaggggtgga caagggcggg gctgccagggatagctgggg tgggtagaga caatagaagg 4440 ggaaaacaag gcggagttgg tggtttggggacataggagg gctgtggcaa cttggaaccc 4500 tggacttatt tttctcctct gagataaagctggaggcaca gtggccccta gaggctgggc 4560 ttgggagaag aggaactgcc tgggcagggctaggccaggc caggctggtg ctacacagcg 4620 ccccctgccg cccacagtgt ccatggacagcgggtctctg gtgctgtcag ccattggtga 4680 cctccctggg ggctctagcc cgtcggaggactcgtgcctg ctggagctgc agggctccgt 4740 gcgctccgtg gactacgttc tctttcgctccatccagcgc agccgtgccg gctactgcct 4800 cagcctggac tgtggcctgc ggggccccttcgaggaaagc cccctgccac ggcgcccccc 4860 acgggctgcc cgctcctatt cctgctctgcccctgaagct ccacccccac tgggtgcccc 4920 cacagctgcc cgcagctgcc accggttggagggctggccg ccctgggtgg gaccctgctt 4980 ccccgagctg aggcggcggg tcccccggggagggggccgc ccagccgcag ccccgcccac 5040 ccgagccccg actcgtcgct tcagcgatagctcaggttcc ctcaccccac cggggcaccg 5100 gcctcctcat ccggcatccc caccaccgctgctgctgcca cggtcccaca gcgacccagg 5160 catcacgacc tccagtgaca ctggtgagccccctccccga ctgcccaggc tcaggagagg 5220 gtaggcactg ggagttaggt ggccagtgatgcccaccagg attggggcac agttgaggtg 5280 ggtgaggagg aagaaagggt gagttcactcctctggtaga cattctcagg tactcacctc 5340 ctagggacag aacatccata gcttagcagctaaaaaagga gaattttttt ttttttttga 5400 gacggagtct ctggctctgt cgcccaggctggagtgcagt ggcatgatct tggctcactg 5460 caacccgttt cccgggttca agtgactctctcctgcttca gcctgccgag tagctgggac 5520 tacaggtgtg cgccaccatg accggctaatattttttttt tttttttttt ttgagacggt 5580 gtctcgctct gtcacccagg ctggagtgcagccgcgcgat ctgggctcac tgcaactccg 5640 cctcccgggt tcacgccatt ctcccccgggttcacgccat tctccttcct cagcctcccg 5700 agtagtgagt agctgggact acagatgcctgccaccacgc ccggctattt tttgtatttt 5760 tagtagagat ggggtttcac cgtgttatccaggatggtct caatctcctg acctcgtgat 5820 ctgcccgcct cggcctccca aagtgctgggattacaggca tgagtcaccg tgcccgggca 5880 atttttgtat tttttagtag agacaggtttcacccttttg gccaggctgg tcttgaactc 5940 ctgacctcaa gtgatccacc cgcctcggcctcccaaaagg atttattttt tgaaaccagt 6000 tccacagctc tcagcttggt ccacttatctgtcctcccca agcttcagct gtcacttgtt 6060 aacatgtata ataatagtac ttcacgccgggcacggtggc ttgcacctgt aatcccagca 6120 ctgtgggaag ctgaggtggg tggatcacctaggtcgggag ttcgagacca gcctggctaa 6180 catggtgaaa ccctatctct actaaaaatacaaaaattag ccaggtgtgg tggagcgcgc 6240 ctgtaattcc agctactaac gagagaggctaaggcaggag aatcgcttga acctggaaag 6300 caggggttgc cgtgagccaa gatcatgccactgcactcca gcctgggtga cagagacaca 6360 ctccatctca aaaacacaaa caaacaaacaaaaaacatgt ataataacag tacttcagcc 6420 ataggcattg tactcgaaga tgctgagaaagaacagtggc gggcaaggct gttcacacct 6480 ataatcccag cactttggag gccaaggcaggtggatcacc tgaggtcagg agttcaagac 6540 cagcctagcc aacatggtga aacccccatctctactaaaa attgaaaaat tagctgggcc 6600 tggtggtgga cgcctgtaat cccagctactagggaggctg aggcaggaga atcgcttgaa 6660 cctgggaggc ggaggttgca gtgagctgagatcgtaccac tgcactccgg cctgggcaac 6720 acagtgagac tccatctcaa aaaaataagaaaggagatag tactggggaa cgctcagcac 6780 tgtgcgccag gtgctgaaca acaccactgcagtccttgtt gtggtggatt gtaccatcta 6840 gttgctggct aatatggaca gagatgctggccctttgatt ggggatggag cgtgggagct 6900 gtgaaagctc ctctgggctt gagttcccacaggagggtgg gcgtgtccac agaacacttc 6960 cactcactcc ctgtctccct ttctctcttctccccagctg acttcaggga cctttatacc 7020 aaagtgcttg aggaagaagc tgcttctgtttcctctgcag atacaggtca ggcatgtggt 7080 ttgcgcccca gggatgggga ttgggcatggctgcccagcc ccctctccac cctacaatac 7140 cattctctta tctctgtctc tctgcagggctctgctctga agcctgcctc ttccgcctag 7200 cccgctgccc ttcccccaag ttgctacgtgcccggtcagc cgagaaacgg cgccctgtgc 7260 ccaccttcca aaaagttccc ctgccctcgggccctgcacc tgcccactcc ctgggggacc 7320 taaagggcag ctggccaggt cggggcctggtcactcgttt cctccagata tccaggaaag 7380 ccccagaccc cagtgggact ggagctcatggacataagca ggtaggaatt cggggagcca 7440 ggaaagatgt ttgggaaagc gtggagcttcagattgagcc ttattgatga tgccctttct 7500 tgtgtccctg tccaggtgcc ccggagcctgtggggccggc ctggccgaga gagcctccac 7560 cttcgcagct gcggagatct gagctctagctcttccctgc ggcgtctcct gtctggccgc 7620 aggctggagc gtggtacccg cccccacagcctcagcctca acgggggcag ccgggagact 7680 gggctctgac ctaggcttct tgtcacactgaacacatcca gccacaggca ccagctggtt 7740 gggaccagca gcccccagca tcctcttgcactggctggca caaaaagaaa cctgctgtat 7800 accccccaaa gtgtcccttt ccctcctacctctggggtct cttgctgctt gcctctgctg 7860 ctctggtctg ggagagcttc tgtcctgtgctgcatgggta tttagactgt gggggagatg 7920 ccccttctta tagcactgga ggaggaaaacaaattcttgt ccccctcaga atgagagtgg 7980 ctctttctga tttgcaaggg cactatggtcagggcaaagg catggcccag gtgtttaagt 8040 acagggtgac gtgtgcctat gcaatggggtggtaaggcag gcacgaagag tccaaaaaat 8100 ctaggtggcc tctcagctct gccacctctagctgcatgac cttgggcaag ctatgtaacc 8160 ccaattgcct gctccattaa agactgtgaaggtagaatgt ttgtaaagct cttaacagta 8220 tgtaagcctt caataaattt cagttttcccctt 8253 10 10482 DNA Homo sapien 10 tcccagggtc ccgggttggg ggggtggagcagcatttcgt cgccgcgggg gtgccgggac 60 tccggccgca gtgtcgccgc catcacggacttcctgtggg acaagcgcac gggcctcgcc 120 gccagaacgg tgagcgcgcg ggttggctggccgcgcgaaa agatggcgac cgcggggcgg 180 gcggggttag gcttgggttg ggggcaagggacgggggcga gatttggaga ggaggagagt 240 tgagacctac gaagcgacgg agtagggagatgagggggaa ggaggtcggc ctgcgttaga 300 tgtaccaaga agacctgtag gaagcttgggcggaaccaga ctagaaggga tgatagagta 360 aggatgggag tggttccaga gatgtgaaggagtggaatat caaagttgga ctcgggtcca 420 atcaaagctg gaggggagga atacatcaccgtgtcacttg gaacatgcag gaagaatttg 480 cctgtaggag ttctggattt tctgagtaagaaaggaaaaa gcaaaagatt tcaaacttga 540 gaaaggtagg ctagtgatgt tttgaggggacagtgctgtg aggccgtttg atggtagtat 600 gtggtgagat ggcatttatg tggctatgctgattaggata actatgctgg gaaaatcttg 660 tgcttgcagc ttggtatgct accggccccagttacaccaa gaatgcatca aataccaccc 720 tcctcaattt ggggaggttg ggaagaagtgtatacagaat ggtggtcaga catcggaatc 780 attaaacctt ttgcatacgc gattctcatacacccccacc aagacttatg tccctgggcc 840 tgtttcattt tattgtgcct tcctattctcatcttcctgg attgtgacag tgatttttct 900 cttttttcct tgaaggaaaa aaaggatttgtgaaccatcc cattccaatt ccattaactg 960 gaaggtggcc agctgtcagc taagtagggctgatattaat taaaaaccac ttactggcca 1020 ggcgcggtgg ctcacatctg taatcccagcactttgggag gccaaggcgg gcggatcacc 1080 tgaggttggg agttcgagac cagcctgaccaacatggaga aaccctgtct ctactaaaaa 1140 tacaaaatta gctgggcgtg gcggcgcatgcctgtaatcc cagctactcg ggaggctgag 1200 gcaggagaat cgcttgaacc ctgcaggcggaggttatggt gagccgagat tgcgccattg 1260 cactccagcc tgggcaacaa gagcaaaactccgttttcaa aaaaaaaaaa aaaaacacaa 1320 gaaacagttg ttatctacgt cttaagcacaagagtggagg ggccagatgg gatgctgact 1380 gtattttgtg gcctactaca ttaaaacctctcaagggaga gttggccaaa gatgtgagag 1440 atgattaagt taattcaagg ggcccgacccagatccctgg agctgatagt atttggcgtt 1500 tacctggtat atgtagaaac ctgggagactggattcctac cctcaggaca ctgatagtct 1560 acttgaggag atgagatgta ctcttaatgagtgtacaaca tttatcataa ctgtaaacca 1620 aagctttact ctattgactt gtgaggaactcagcatcaca cagatctata tcttagatcc 1680 tgaacgggga tacactgtgg cagtctccatcggtcgtcat acactttgct ctgtagtgaa 1740 ctgtgtccta gacctgaagg tcttcatggagctgtgtttg aagaaaataa ggatagaaca 1800 cttgaactgg ctgggtgcgg tggctcacgcctgtaatccc agcactttgg gaggccgagg 1860 caggaggatc acgagtcagg agtttgagaccatcctggcc aacatggtga aaccccgtct 1920 ctactaaaaa tataaaaatt agccaggagtggtggtgcgt gcctgtaatc ccagctacac 1980 aggaggctga ggcaggagag ttgcttgaacctgggaagtg gaggttgcag tgagccgaga 2040 ttgtgccact gcactccagc ctggccacacagcgagactc tgtctcaaga ataaaataag 2100 aacacttgaa ctgaggggtg ggtaatgagggatgggtttg ggggctttct agagtttcag 2160 ccagagttag taggtggctg tactccaataaggaggaatt gtaagtacag aggaagactg 2220 aggaagttgt gcctggtaaa aagaggcagaaataagaggg gaaagcagct tagcatcttt 2280 ggagcagcca gtccttggag gagatttctgaaaactcgaa actttcctat ttccgctttc 2340 ccttgccttc cagatgccgc atcctcgaaggtaccactcc tcagagcgag gcagccgggg 2400 gagttaccgt gaacactatc ggagccgaaagcataagcga cgaagaagtc gctcctggtc 2460 aagtagtagt gaccggacac gacggcgtcggcgagaggac agctaccatg tccgttctcg 2520 aaggtgaggc ctcaggaaga gatactcaaacacagggtca ctcgctcatt cctttgttca 2580 acaaatactt attggttata tgctaaaaccattgccttaa gtagctggct gaaggtaatg 2640 tcgggataga taaccacaga agccatttgggctgcagtgt taagtgttag aatacataat 2700 gaggaactga ggacgtaccc agtgcgtttggttggagtgg ggcaggatgt taattcagat 2760 aaaacattgc agaaagcaac atctgtatatgtctggagga caagtagagt caggtggggg 2820 aagtgtcttt gggcagaggg aaagcatgtgcaaaggactg ggagctacag gtaactaata 2880 agctaacaga taagacaaag ttagatctttgtgtttttaa ttttttaaaa attttaaaat 2940 atcatttatt tatttttgag acagagtttcactctgttgc ccaggctgga gtgcagtggc 3000 aagatctcag ctcactgcaa actctatggccccaggttca agttgattct tgtgcctcag 3060 cctcccaagt tacctgggac tacaggcatgcgccatcatg cctggctaat tttttttttt 3120 tttttttgta tttttagtag agacaggatttcaccatgtt ggcctcaaac tcctgacctc 3180 aagtgctcca cctgccttgg cctcccaaagtgctgggatt acaggcgtga gccactgtgc 3240 ccggcccagt tttaatattt tagtggtatgttcagtggaa agaatgggtt aaggatggtt 3300 tagagcaagg ggggatttca ttcaggaaagcttgagaaaa aaatggctct ttgtttcttt 3360 gtatcactgc agcgggaaaa aggccccaatagggagtggt tctgcagatg ccttttcaga 3420 ggttttggtg acatggcaga tgtgcttggttaacctcaac ttcttctggt cccttggtcc 3480 tctccagtga ctcaaacaag tgtctgagaacagtttttct tagcattcac tctttgtcct 3540 cactaaggca ggaccttgtc acagggtatggtagggaatg tgatctgaac agatacacct 3600 cattggtatt tgttatccaa cagcagttatgatgatcgtt cgtccgaccg gagggtgtat 3660 gaccggcgat actgtggcag ctacagacgcaacgattata gccgggatcg gggagatgcc 3720 tactatgaca cagactatcg gcattcctatgaatatcagc gggagaacag cagttaccgc 3780 agccagcgca gcagccggag gaagcacagacggcggagga ggcgcagccg gacatttagc 3840 cgctcatctt cggtgagtgc cagcccaggcccttcctctc cccactcttc tgcaggccct 3900 ctaggactct ggtaagtgag cagtatccttgttctcagct gaacattggg gcatgaacac 3960 tgaggtgggc actgagtttg cctactttcttggaagctct ccgactcttg aagggccctg 4020 gatctgcttt ggagatggat gggcacggagcatttgtgac ccccagtgct ctccctggca 4080 tgttgggctt attgtgttgg gagcagcttctccgccccag cggcctccac tctttaatgg 4140 ggaccttgct tgttgaactg cctttttcccccaagccctg ggctctgtag cccccttggc 4200 aggcgggctt gggtgggggc aaggggagatctgtgtctgc ccggaagggc attgtgtaga 4260 gcatggggtt gtgggtgaca ttggcaacaacaccacttct ctgctctgcc catgcctctc 4320 cctgttcctg ttttgggttt ggggacttgggattatgcgc cgctctctct tctcacaccg 4380 atccacctta ctgcatctga cgtgttcccttccactgtcc cccatcatcg tctgtccccc 4440 ctggctgggc gcctgtgacc ggtgacccctctaccaccca ccccgcctcc ctccggcccc 4500 cggctccgaa cactgggctt ggttcggaaccccccctgcc ccccgacagc agcacagcag 4560 ccggagagcc aagagtgtag aggacgacgctgagggccac ctcatctacc acgtcgggga 4620 ctggctacaa gagcgatgta caagccaaatcgtaacaatc ctatagcctg taatggtccc 4680 atagccatcc taacgtccca agccaacctgagtcacagcc tcttgccttt tgctcagtgg 4740 tgttgtttat ctgggtggtt tgagttgctgttgagaattg acctctgttg tccactccca 4800 gctctcacgg cccctgggtt aaaggtggtgggaatcacgc aggggttttc ttcccctgtg 4860 accatctgta tctgttcccc ttccttcatctccaccccag gttgctgtcc ccttttttct 4920 tccaactcag ctcattcccc accttctctccctccctctc cccgaccctg ctctctttca 4980 tttcagatga aatcgttagc accttaggagaggggacctt cggccgagtt gtacaatgtg 5040 ttgaccatcg caggtaactg tcagtccctccctactatgt ggggctaaag agatggttgg 5100 ggttatatgg ggcttttttg ctaattaacctgaggtagaa tttcttagtc cccctacagc 5160 cctgttcatt ttgagacatt cttgagaacccagcaaaagc ctctcctgcc aacttacagg 5220 ggtggggctc gagttgccct gaagatcattaagaatgtgg agaagtacaa ggaagcagct 5280 cgacttgaga tcaacgtgct agagaaaatcaatgagaaag accctgacaa caagaagtaa 5340 gcaagcaagg aagtgtgtag ggaggctgagagccccaacc cctacacggg agagattcct 5400 agacctggtt taggcagaca gggggagtcctagaccttct catccattca tcttttgttc 5460 atcatcttag agtagtagaa catcagggtaggaaaggggg ggggcccatg acattcctta 5520 atccactccc cttgttttca gggaagttagattgctctgt cgtttgaccc cagcctagaa 5580 tggcttctgt agagatatac cctggacatagccctaggac tgagcatcaa cctcatcgaa 5640 caatgaatta agcttttata agtagaactatgctgataag gccacagaca tttatgcagt 5700 aatattttgt tcacatacat tcacagtccaaaaatagaat aaggacttta agccaagata 5760 tgaggccagt aggtataaat aggagcccagtggtagctag tggtgaatga atatcaggag 5820 ttgggctggg gtttgggatt taggatggacagtttgatga ttccaggatc tatactgttt 5880 ggagcttggc actccacagc tcttccagcaaatagttttt gaactattta aaatgacgca 5940 ggaagatatt ttgaaacttg ggctgggcatggtggctcac gcctgtaatc ccagcacttt 6000 ggaggccgag gcgggtggat cacaatgtcaggagttcgag accagcctgg ccaatatggt 6060 gaaaccccgt ctgtactaaa aataaaaaaattagccgggc atggtggtgg acaactgtag 6120 tcccagctac ttggaaggct gaggcaggagaatcgcttga acaggaggca gaggttgcag 6180 tgagccaaga tcgcgccact gcactccagcctgggtgaca gagtgagacg ccatctcaaa 6240 aacaaaaaag aaaaaaacat ggcagatagtaccatttctt tcttgggtct ggtagaggct 6300 actccttagc tgaactgaat tttggtgttagtactagctg gcatggtttt acacaagtta 6360 tgtggaatca acagctatga agtacctccttgttggatgg ctgatgggca gatgggagct 6420 catcagacaa tcccccttcc cccatctctcttctcagcct ctgtgtccag atgtttgact 6480 ggtttgacta ccatggccac atgtgtatctcctttgagct tctgggcctt agcaccttcg 6540 atttcctcaa agacaacaac tacctgccctaccccatcca ccaagtgcgc cacatggcct 6600 tccagctgtg ccaggctgtc aagtgtgagtggggtgggcc gaagtggact ctggggcagt 6660 ccctcccttc attggatctc ttctgtcggttgtgcactgg tgaagcccct aaacagtcag 6720 ctgtctgtta tctgcagttc tttgatttactgtcatcttg aaacgtcttc tgacttaact 6780 ccttgactga tgtctttatc gtcactgattgctcttactc tacacctagc ctagcagcag 6840 ctagaagaga aagcctttgg aatcaaagcacttaattacc ctcccctttt cctttctccc 6900 tttcttggga aagcatcagt cagacagcaaacataaagag acaaaaatac actccttagg 6960 gtaaaggctt acatttgtct gggatgagatgttcattcac agcaaaggag atgggaacac 7020 agagtatgta gttcagctaa ggggcaaggtggggaattca aagaaatata tcattcctct 7080 ggagtcatca aaataataca gtttcacagaattgagttaa cataatgcca gctagacaca 7140 tgtcaaagtg tgtacacact acaactcaaagcaaactttt ttttttttta catcagttga 7200 gctacatatg tatcttacat tagaaagagcagagcttctt agaccaggca ttccatttag 7260 acgtagagcg gaaagcagcc ctagtgattctggacctgtc tcctcactgg ctttgcccta 7320 ggtaaccagg cctggggtca gctgataccagttagctctg gccacctgca ccaagcctga 7380 cctggccttc tcccctacag tcctccatgataacaagctg acacatacag acctcaagcc 7440 tgaaaatatt ctgtttgtga attcagactatgagctcacc tacaacctag agaaggtaag 7500 atggataggg tctgcccttg gttactgggggcaggcagct gcaccacttt gccttctgcc 7560 gagccctttg ttttctccct tttatttcgtcctcccacat tttctccctg actggctcca 7620 actgggtaaa actaaatagg ttgaaagggagaaatctctt aagaagactt aaattgggag 7680 ataacttgta caggggactt cagataactttccagtagag tgaaagtttt taaggttctt 7740 ggtttgggct ttatttattt atttatttatttatttattt atttatttga gacagagtct 7800 cactctgttg cccagactga tgtgcagtggcataatctcg gctcactgca acctctacct 7860 cccaggttta agtgattctc ctgcctcagcctctggagta gctggggtta caggcacccg 7920 ccaccacgcc tggctagttt ttgtgtttttagtagagatg gggttttgcc atgttggcca 7980 ggctggtctc gaacccctaa cctcaggtgatctgcctgtc tcgggctccc agaatgctgg 8040 gattacgggt gtgagccaca gtgcctggcccggacttctt ttttgagatg tgtatttctg 8100 tgaggtagga aagctcggct cctttgactaactggagata atgaagatct cagacctaaa 8160 gaagctctcc ctcctttgac ccctttgctacatgttacat atttttagag aaactccttt 8220 gctacatgtt acattttttc agaaaaacccatttgctaca tgttacattt tttcagaaaa 8280 acccatttgc tacatgttac atgttttcagaataactata tatctggcac ttgagtgtag 8340 tcttcagatg cttacagatg tgcacgctgttctataatca tttcttaatc cttttgctcc 8400 cctacttcta aaagtattat gctggatgtggtggaagctg taaaacagga atttcccagg 8460 cttgtaaggc caggcaggag atcacagtaccttttttaag gctcagaagc aatgtagaga 8520 atactggtgg aatctcagtc tgggtgcagaggttcatcct ctttctcctc tgctcctaga 8580 agcgagatga gcgcagtgtg aagagcacagctgtgcgggt ggtagacttt ggcagtgcca 8640 cctttgacca tgagcaccat agcaccattgtctccactcg ccattaccga gcaccagaag 8700 tcatccttgg taagggaggg caaggctgtccaagtgtgtg agatgatgtg agggtggggc 8760 cacctaagcc tcataacacc ttttccctcccattctcacc cagagttggg ctggtcacag 8820 ccttgtgatg tgtggagtat aggctgcatcatctttgaat actatgtggg attcaccctc 8880 ttccaggtaa gtgatgggat gtcttacttgactgcctggc attcttctac ctggttcctt 8940 tgttttctgc tgaggacctg cctactcagttctccacatt gcctgccttc ctggcagctg 9000 atccttagca tgcccttttc agtcccatgctcagtttctg tttttgtttc ccagacccat 9060 gacaacagag agcatctagc catgatggaaaggatcttgg gtcctatccc ttcccggatg 9120 atccgaaaga caaggtgaac cttgagggggcactagttaa ctcttttcct tttctctcca 9180 cagaattggt ctatttcaca tcattttcttttttctttga tacctcctct ccccccagtt 9240 actttcagat ggggaaataa gggaattgtaacaagggtga ccttctgatt cctcaacctc 9300 cccttcccct ctagaaagca gaaatatttttaccggggtc gcctggattg ggatgagaac 9360 acatcagctg ggcgctatgt tcgtgagaactgcaaaccgc tgcgggtgag ctgggctcgg 9420 gataaatagt gcccaccgtc cagaagtcacttccttctta gggtggttgc cccctggaat 9480 gctcttcaac aagccagagg gttaggaaaggaggggagga aagctgaaag aagacatctt 9540 tggtcaacag aggaaacata agagggagtggttttgcgga gggaaggagg ttagacagcc 9600 taaccttgag acaaccagag atcaaagcaatgtcctggat tctttaggtc agacagaaaa 9660 gaataaacta cccttgaaga gcttacattttaatgaggaa ctaaagaaga ttcatgaagt 9720 tgacaaggat atacaagtag aaagaactttcaaagattat ggagtaattg tgctagaggg 9780 aaggtaggtt gagctataat atcagaaacgttggtcctgg tgtggtcgtg gttaggtagc 9840 cttcaaattg gttgcaagca gagccttggttctccaagaa tgaaaggtag ggtcttgaag 9900 aaggcagggt ttgtaaggca tcctgcctcacctttttcct gccctcctcc accagcggta 9960 tctgacctca gaggcagagg aacaccaccagctcttcgat ctgattgaaa gcatgctaga 10020 gtatgaacca gctaagcggc tgaccttgggtgaagccctt cagcatcctt tcttcgcccg 10080 ccttcgggct gagccgccca acaagttgtgggactccagt cgggatatca gtcggtgacg 10140 atcaggccct gggcccccct gcatcttttatagcagtggg tgtccagtcc aggacactgg 10200 tgctttttta tacaagagaa cgagccagagttcactcctt cctcctggct ctctatatac 10260 ctgtgaatat gtgaaatagt gtaaatatgaaagaacttgt acctatcact tcaacccctg 10320 ccttgtacat aatactattc catccacacagtttccaccc tcacctgccc cctcatacgg 10380 agttggatgg gggccgagtg aggtaaccaggtggcatcta ccccatgttt tataaggaat 10440 tttgtacagt ctttgtgaaa taaaataacgtgcttcattt ga 10482

What is claimed is:
 1. An isolated nucleic acid comprising the nucleicacid set forth in the Sequence Listing as SEQ ID NO:
 2. 2. The isolatednucleic acid of claim 1 in a vector suitable for expressing the nucleicacid.
 3. The vector of claim 2 in a host suitable for expressing thenucleic acid.
 4. A polypeptide encoded by the nucleic acid of claim 1.5. An isolated nucleic acid encoding the polypeptide of claim
 4. 6. Apurified antibody which specifically binds to the polypeptide of claim4.
 7. An isolated nucleic acid comprising the nucleic acid set forth inthe Sequence Listing as SEQ ID NO:
 4. 8. The isolated nucleic acid ofclaim 7 in a vector suitable for expressing the nucleic acid.
 9. Thevector of claim 8 in a host suitable for expressing the nucleic acid.10. A polypeptide encoded by the nucleic acid of claim
 7. 11. Anisolated nucleic acid encoding the polypeptide of claim
 10. 12. Apurified antibody which specifically binds to the polypeptide of claim10.
 13. An isolated nucleic acid comprising the nucleic acid set forthin the Sequence Listing as SEQ ID NO:
 6. 14. The isolated nucleic acidof claim 13 in a vector suitable for expressing the nucleic acid. 15.The vector of claim 14 in a host suitable for expressing the nucleicacid.
 16. A polypeptide encoded by the nucleic acid of claim
 13. 17. Anisolated nucleic acid encoding the polypeptide of claim
 16. 18. Apurified antibody which specifically binds to the polypeptide of claim16
 19. An isolated double-stranded nucleic acid consisting of 1) asingle-stranded DNA which has a molecular size of 6.4 Kb and is derivedfrom humans, and 2) a DNA complementary to the single-stranded DNA,giving the restriction pattern corresponding to the propin1 gene shownin FIG.
 1. 20. A single-stranded RNA corresponding to thesingle-stranded DNA of claim
 19. 21. A single-stranded RNA correspondingto the complementary DNA of claim
 19. 22. A polypeptide encoded by theisolated double-stranded nucleic acid of claim
 19. 23. An isolateddouble-stranded nucleic acid consisting of 1) a single-stranded DNAwhich has a molecular size of 8.3 Kb and is derived from humans, and 2)a DNA complementary to the single-stranded DNA, giving the restrictionpattern corresponding to the cote1 gene shown in FIG. 1
 24. Asingle-stranded RNA corresponding to the single-stranded DNA of claim23.
 25. A single-stranded RNA corresponding to the complementary DNA ofclaim
 23. 26. A polypeptide encoded by the isolated double-strandednucleic acid of claim 23.